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Lyme Disease Vaccines 



Field of the Invention 

The present invention relates to novel vaccines for the prevention or attenuation of Lyme 
disease. The invention further relates to isolated nucleic acid molecules encoding antigenic 
polypeptides of Borrelia burgdorferi. Antigenic polypeptides are also provided, as are vectors, 
host cells and recombinant methods for producing the same. The invention additionally relates to 
diagnostic methods for detecting Borrelia gene expression. 



Background of the Invention 

Lyme disease (Steere, A.C., Proc. Natl Acad. Sci. USA 97:2378-2383 (1991)), or Lyme 
borreliosis, is presently the most common human disease in the United States transmitted by an 
arthropod vector (Center for Disease Control, Morbid. Mortal. Weekly Rep. 46(23):53l-535 
(1997)). Further, infection of house-hold pets, such as dogs, is a considerable problem. 

While initial symptoms often include a rash at the infection point, Lyme disease is a 
multisystemic disorder that may include arthritic, carditic, and neurological manifestations. While 
antibiotics are currently used to treat active cases of Lyme disease, B. burgdorferi persists even 
after prolonged antibiotic treatment. Further, B. burgdorferi can persist for years in a mammalian 
host in the presence of an active immune response (Straubinger, R. et al, J. Clin. Microbiol 
35:111-116 (1997); Steere, A., N. Engl. J. Med. 327:586-596 (1989)). 

Lyme disease is caused by the related tick-borne spirochetes classified as Borrelia 
burgdorferi sensu lato (including B. burgdorferi sensu stricto, B. afzelii, B. garinii). Although 
substantial progress has been made in the biochemical, uitrastructural, and genetic characterization 
of the organism, the spirochetal factors responsible for infectivity, immune evasion and disease 
pathogenesis remain largely obscure. 

A number of antigenic B. burgdorferi cell surface proteins have been identified. These 
include the outer membrane surface proteins (Osp) OspA, OspB, OspC and OspD. OspA and 
OspB are encoded by tightly linked tandem genes which are transcribed as a single transcriptional 
unit (Brusca, J. et al, J. Bacteriol. 773:8004-8008 (1991)). The most-studied B. burgdorferi 
membrane protein is OspA, a lipoprotein antigen expressed by borreliae in resting ticks and the 
most abundant protein expressed in vitro by most borrelial isolates (Barbour, A.G., et al, 
Infection & Immunity 47:795-804 (1983); Howe, T.R., et al, Science 227:645 (1985)). 
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A number of different types of Lyme disease vaccines have been shown to induce 
immunological responses. Whole-cell B. burgdorferi vaccines, for example, have been shown to 
induce both immunological responses and protective immunity in several animal models 
(Reviewed in Wormser, G., Clin. Infect. Dis. 27:1267-1274 (1995)). Further, passive immunity 
5 has been demonstrated in both humans and other animals using B. burgdorferi specific antisera. 

While whole-cell Lyme disease vaccines confer protective immunity in animal models, use 
of such vaccines presents the risk that responsive antibodies will produce an autoimmune 
response (Reviewed in Wormser, G., supra). This problem is at least partly the result of the • 
production of B. burgdorferi specific antibodies which cross-react with hepatocytes and both 
10 muscle and nerve cells. B. burgdorferi heat shock proteins and the 41-kd fiagellin subunit are 
believed to contain antigens which elicit production of these cross-reactive antibodies. 

Single protein subunit vaccines for Lyme disease have also been tested. The cell surface 
if | proteins of B. burgdorferi are potential candidates for use in such vaccines and several have been 
W shown to elicit protective immune responses in mammals (Probert, W. et al., Vaccine 75:15-19 
If (1997); Fikrig, E. et al, Infect. Immun. (53:1658-1662 (1995); Langerman S. et al, Nature 
W 372:552-556 (1994); Fikrig, E. et al, J. Immunol. 745:2256-2260 (1992)). Experimental OspA 
~~. vaccines, for example, have demonstrated efficacy in several animal models (Fikrig, E., et al, 
** Proc. Natl. Acad. Sci. USA 59:5418-5421 (1992); Johnson, B.J., et al, Vaccine 73:1086-1094 
§. (1996); Fikrig, E., et al, Infect. Immun. 60:657-661 (1992); Chang, Y.F., et al, Infection & 
Immunity 63:3543-3549 (1995)), and OspA vaccines for human use are under clinical evaluation 
(Keller, D., et al, J. Am. Med. Assoc. 277:1764-1768 (1994); Van Hoecke, C., et al, Vaccine 
uf 14: 1620-1626 (1996)). Passive immunity is also conferred by antisera containing antibodies 

specific for the full-length OspA protein. Further, vaccination with plasmid DNA encoding OspA 
has been demonstrated to elicit protective immune responses in mice (Luke, C. et al, J. Infect. 
25 Dis. 775:91-97 (1997); Zhong,~W. et al, Eur. J. Immunol 26:2749-2757 (1996)). 

Recent immunofluorescence assay observations indicate that during tick engorgement the 
expression of OspA by borreliae diminishes (deSilva, A.M., et al, J. Exp. Med. 753:271-275 
(1996)) while expression of other proteins,' exemplified by OspC, increases (Schwan, T.G., et 
al, Proc. Natl. Acad. Sci. USA 92:2909-2913 (1985)). By the time of transmission to hosts, 
30 spirochetes in the tick salivary glands express little or no OspA. This down-modulation of OspA 
appears to explain the difficulties in demonstrating immune responses to this antigen early in 
infection following tick bites (Kalish, R.A., et al, Infect. Immun. 63:2228-2235 (1995); Gern, 
L., etal, J. Infect. Dis. 167:911-915 (1993); Schiable, U.E., et al, Immunol Lett. 36:219-226 
(1993)) or following challenge with limiting doses of cultured borreliae (Schiable, U.E., et al, 
35 Immunol. Lett. 36:219-226 (1993); Barthold, S.W 7 . and Bockenstedt, L.K., Infect. Immun. 
(57:4696-4702 (1993)). 

Furthermore, OspA-specific antibodies are ineffective if administered after a borrelial 
challenge delivered by syringe (Schiable, U.E., et al, Proc. Natl. Acad. Sci. USA 57:3768-3772 
(1990)) or tick bite (deSilva, A.M., et al, J. Exp. Med. 753:271-275 (1996)). To be efficacious, 
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OspA vaccines must elicit protective levels of antibody which are maintained throughout periods 
of tick exposure in order to block borrelia transmission from the arthropod vector. 

Vaccines in current use against other pathogens include in vzvo-expressed antigens which 
could boost anamnestic responses upon infection, potentiate the action of immune effector cells 

5 and'complement, and inhibit key virulence mechanisms. OspC is both expressed during infection 
(Montgomery, R.R., et al, J. Exp. Med. 183:261-269 (1996)) and a target for protective 
immunity (Gilmore, R.D., et al, Infect. Immun. (54:2234-2239 (1996); Probert, W.S. and 
LeFebvre, R.B., Infect. Immun. (52:1920-1926 (1994); Preac-Mursic, V., et al, Infection 
20:342-349 ( 1992)), but mice immunized with -this protein were only protected against challenge 

10 with the homologous borrelial isolate (Probert, W.S., et al, J. Infect. Dis. 775:400-405 (1997)). 
Identification of in vz'vo-expressed, and broadly protective, antigens of B. burgdorferi has 
remained elusive. 

Summary of the Invention 

B The present invention provides isolated nucleic acid molecules comprising polynucleotides 

encoding the B. burgdorferi peptides having the amino acid sequences shown in Table 1. Thus, 
one aspect of the invention provides isolated nucleic acid molecules comprising polynucleotides 
having a nucleotide sequence selected from the group consisting of: (a) a nucleotide sequence 
encoding any of the amino acid sequences of the full-length polypeptides shown in Table 1; (b) a 

:10 nucleotide sequence encoding any of the amino acid sequences of the full-length polypeptides 
shown in Table 1 but minus the N-terminal methionine residue, if present; (c) a nucleotide 

: sequence encoding any of the amino acid sequences of the truncated polypeptides shown in Table 
1; and (d) a nucleotide sequence complementary to any of the nucleotide sequences in (a), (b), or 
(c) above. 

25 Further embodiments of the invention include isolated nucleic acid molecules that 

comprise a polynucleotide having a nucleotide sequence at least 90% identical, and more 
preferably at least 95%, 96%, 97%, 98% or 99% identical, to any of the nucleotide sequences in 
(a), (b), (c), or (d) above, or a polynucleotide which hybridizes under stringent hybridization 
, conditions to a polynucleotide in (a), (b), (c), or (d) above. This polynucleotide which hybridizes 

30 does not hybridize under stringent hybridization conditions to a polynucleotide having a 

nucleotide sequence consisting of only A residues or of only T residues. Additional nucleic acid 
embodiments of the invention relate to isolated nucleic acid molecules comprising polynucleotides 
which encode the amino acid sequences of epitope-bearing portions of a B. burgdorferi 
polypeptide having an amino acid sequence in (a), (b), or (c) above. 

35 . The present invention also relates to recombinant vectors, which include the isolated 

nucleic acid molecules of the present invention, and to host cells containing the recombinant 
vectors, as well as to methods of making such vectors and host cells and for using these vectors 
for the production of B. burgdorferi polypeptides or peptides by recombinant techniques. 

The invention further provides isolated B. burgdorferi polypeptides having an amino acid 
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sequence selected from the group consisting of: (a) an amino acid sequence of any of the full- 
length polypeptides shown in Table 1; (b) an amino acid sequence of any of the full-length 
polypeptides shown in Table 1 but minus the N-terminal methionine residue, if present; (c) an 
amino acid sequence of any of the truncated polypeptides shown in Table 1 ; and (d) an amino acid 
sequence of an epitope-bearing portion of any one of the polypeptides of (a), (b), or (c). 

The polypeptides of the present invention also include polypeptides having an amino acid 
sequence with at least 70% similarity, and more preferably at least 75%, 80%, 85%, 90%, 95%, 
96%, 97%, 98%, or 99% similarity to those described in (a), (b), (c), or (d) above, as well as 
polypeptides having an amino acid sequence at least 70% identical, more preferably at least 75% 
identical, and still more preferably 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% identical to 
those above; as well as isolated nucleic acid molecules encoding such polypeptides. 

The present invention further provides a vaccine, preferably a multi-component vaccine 
comprising one or more of the B. burgdorferi polypeptides shown in Table 1, or fragments 
thereof, together with a pharmaceutically acceptable diluent, carrier, or excipient, wherein the 
B. burgdorferi polypeptide(s) are present in an amount effective to elicit an immune response to 
members of the Borrelia genus in an animal. The B. burgdorferi polypeptides of the present 
invention may further be combined with one or more immunogens of one or more other borrelial 
or non-borrelial organisms to produce a multi-component vaccine intended to elicit an 
immunological response against members of the Borrelia genus and, optionally, one or more non- 
borrelial organisms. 

The vaccines of the present invention can be administered in a DNA form, e.g., "naked" 
DNA, wherein the DNA encodes one or more borrelial polypeptides and, optionally, one or more 
polypeptides of a non-borrelial organism. The DNA encoding one or more polypeptides may be 
constructed such that these polypeptides are expressed fusion proteins. 

The vaccines of the present invention may also be administered as a component of a 
genetically engineered organism. Thus, a genetically engineered organism which expresses one 
or more B. burgdorferi polypeptides may be administered to an animal. For example, such a 
genetically engineered organism may contain one or more B. burgdorferi polypeptides of the 
present invention intracellularly, on its cell surface, or in its periplasmic space. Further, such a 
genetically engineered organism may secrete one or more B. burgdorferi polypeptides. 

The vaccines of the present invention may be co-administered to an animal with an 
immune system modulator {e.g., CD86 and GM-CSF). 

The invention also provides a method of inducing an immunological response in an animal 
to one or more members of the Borrelia genus, e.g., B. burgdorferi sensu stricto, B. afzelii, and 
B. garinii, comprising administering to the animal a vaccine as described above. 

The invention further provides a method of inducing a protective immune response in an 
animal, sufficient to prevent or attenuate an infection by members of the Borrelia genus, 
comprising administering to the animal a composition comprising one or more of the polypeptides 
shown in Table 1, or fragments thereof. Further, these polypeptides, or fragments thereof, may 




be conjugated to another immunogen and/or administered in admixture with an adjuvant. 

The invention further relates to antibodies elicited in an animal by the administration of one 
or more B. burgdorferi polypeptides of the present invention. 

The invention also provides diagnostic methods for detecting the expression of genes of 
members of the Borrelia genus in an animal. One such method involves assaying for the 
expression of a gene encoding Borrelia peptides in a sample from an animal. This expression 
may be assayed either directly {e.g., by assaying polypeptide levels using antibodies elicited in 
response to amino acid sequences shown in Table 1) or indirectiy {e.g., by assaying for 
antibodies having specificity for amino acid sequences shown in Table 1). An example of such a 
method involves the use of the polymerase chain reaction (PCR) to amplify and detect Borrelia 
nucleic acid sequences. 

The present invention also relates to nucleic acid probes having all or part of a nucleotide 
sequence shown in Table 1 which are capable of hybridizing under stringent conditions to 
Borrelia nucleic acids. The invention further relates to a method of detecting one or more Borrelia 
nucleic acids in a biological sample obtained from an animal, said one or more nucleic acids 
encoding Borrelia polypeptides, comprising: 

a) contacting the sample with one or more of the above-described nucleic acid probes, 
under conditions such that hybridization occurs, and 

b) detecting hybridization of said one or more probes to the Borrelia nucleic acid present 
in the biological sample. 

Detailed Description 

The present invention relates to recombinant antigenic B. burgdorferi polypeptides and 
fragments thereof. The invention also relates to methods for using these polypeptides to produce 
immunological responses and to confer immunological protection to disease caused by members 
of the genus Borrelia. The invention further relates to nucleic acid sequences which encode 
antigenic B. burgdorferi polypeptides and to methods for detecting Borrelia nucleic acids and 
polypeptides in biological samples. The invention also relates to Borrelia specific antibodies and 
methods for detecting such antibodies produced in a host animal. 

Definitions 

The following definitions are provided to clarify the subject matter which the inventors 
consider to be the present invention. 

As used herein, the phrase "pathogenic agent" means an agent which causes a disease state 
or affliction in an animal. Included within this definition, for examples, are bacteria, protozoans, 
fungi, viruses and metazoan parasites which either produce a disease state or render an animal 
infected with such an organism susceptible to a disease state {e.g., a secondary infection). 
Further included are species and strains of the genus Borrelia which produce disease states in 
animals. 
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As used herein, the term "organism" means any living biological system, including 
viruses, regardless of whether it is a pathogenic agent. 

As used herein, the term "Borrelia" means any species or strain of bacteria which is 
members of the genus Borrelia. Included within this definition are Borrelia burgdorferi sensu lato 
(including B. burgdorferi sensu stricto, B. afzelii, B. garinii), B. andersonii, B. anserina, B. 
japonica, B. coriaceae, and other members of the genus Borrelia regardless of whether they are 
known pathogenic agents. 

As used herein, the phrase "one or more B. burgdorferi polypeptides of the present 
invention" means the amino acid sequence of one or more of the B. burgdorferi polypeptides 
disclosed in Table 1 . These polypeptides may be expressed as fusion proteins wherein the 
B. burgdorferi polypeptides of the present invention are linked to additional amino acid 
sequences which may be of borrelial or non-borrelial origin. This phrase further includes 
fragments of the B. burgdorferi polypeptides of the present invention. 

As used herein, the phrase "full-length amino acid sequence" and "full-length polypeptide" 
refer to an amino acid sequence or polypeptide encoded by a full-length open reading frame 
(ORF). An ORF may be defined as a nucleotide sequence bounded by stop codons which 
encodes a putative polypeptide. An ORF may also be defined as a nucleotide sequence within a 
stop codon bounded sequence which contains an initiation codon (e.g., a methionine or valine 
codon) on the 5' end and a stop codon on the 3' end. 

As used herein, the phrase "truncated amino acid sequence" and "truncated polypeptide" 
refer to a sub-sequence of a full-length amino acid sequence or polypeptide. Several criteria may 
also be used to define the truncated amino acid sequence or polypeptide. For example, a truncated 
polypeptide may be defined as a mature polypeptide (e.g., a polypeptide which lacks a leader 
sequence). A truncated polypeptide may also be defined as an amino acid sequence which is a 
portion of a longer sequence that has been selected for ease of expression in a heterologous 
system but retains regions which render the polypeptide useful for use in vaccines (e.g., antigenic 
regions which are expected to elicit a protective immune response). 

Additional definitions are provided throughout the specification. 
Explanation of Table 1 

Table 1 lists B. burgdorferi nucleotide and amino acid sequences of the present invention. 
The nomenclature used therein is as follows: 

"nt" refers to nucleotide sequences; 

"aa" refers to amino acid sequences; 

"f" refers to full-length nucleotide or amino acid sequences; and 
"t" refers to truncated nucleotide or amino acid sequences. 

Thus, for example, the designation "f 101. aa" refers to the full-length amino acid sequence 
of B. burgdorferi polypeptide number .101. Further, "flOl.nt" refers to the full-length nucleotide 
sequence encoding the full-length amino acid sequence of B. burgdorferi polypeptide number 
101. 



Explanation of Table 2 

Table 2 lists accession numbers for the closest matching sequences between the 
polypeptides of the present invention and those available through GenBank and GeneSeq 
databases. These reference numbers are the database entry numbers commonly used by those of 
skill in the art, who will be familar with their denominations. The descriptions of the 
numenclature for GenBank are available from the National Center for Biotechnology Information. 
Column 1 lists the gene or ORF of the present invention. Column 2 lists the accession number of 
a "match" gene sequence in GenBank or GeneSeq databases. Column 3 lists the description of 
the "match" gene sequence. Columns 4 and 5 are the high score and smallest sum probability, 
respectively, calculated by BLAST. Polypeptides of the present invention that do not share 
significant identity/similarity with any polypeptide sequences of GenBank and GeneSeq are not 
represented in Table 2. Polypeptides of the present invention that share significant 
identity/similarity with more than one of the polypeptides of GenBank and GeneSeq are 
represented more than once. 

Explanation of Table 3. 

The B. burgdorferi polypeptides of the present invention may include one or more conservative 
amino acid substitutions from natural mutations or human manipulation as indicated in Table 3. 
Changes are preferably of a minor nature, such as conservative amino acid substitutions that do 
not significantly affect the folding or activity of the protein. Residues from the following groups, 
as indicated in Table 3, may be substituted for one another: Aromatic, Hydrophobic, Polar, Basic, 
Acidic, and Small, 

Explanation of Table 4 

Table 4 lists residues comprising antigenic epitopes of antigenic epitope-bearing fragments 
present in each of the full length B. burgdorferi polypeptides described in Table 1 as predicted by 
the inventors using the algorithm of Jameson and Wolf, (1988) Comp. Appl. Biosci. 4:181-186. 
The Jameson- Wolf antigenic analysis was performed using the computer program PROTEAN 
(Version 3.1 1 for the Power Macintosh, DNASTAR, Inc., 1228 South Park Street Madison, 
WI). B. burgdorferi polypeptide shown in Table 1 may one or more antigenic epitopes 
comprising residues described in Table 4. It will be appreciated that depending on the analytical 
criteria used to predict antigenic deterrninants, the exact address of the determinant may vary 
slightly. The residues and locations shown described in Table 4 correspond to the amino acid 
sequences for each full length gene sequence shown in Table 1 and in the Sequence Listing. 
Polypeptides of the present invention that do not have antigenic epitopes recognized by the 
Jameson- Wolf algorithm are not represented in Table 2. 
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Selection of Nucleic Acid Sequences Encoding Antigenic B. burgdorferi 
Polypeptides 

The present invention provides a select number of ORFs from those presented in the 
fragments of the Borrelia burgdorferi genome which may prove useful for the generation of a 
protective immune response. The sequenced B. burgdorferi genomic DNA was obtained from a 
sub-cultured isolate of ATCC Deposit No. 35210. The sub-cultured isolate was deposited on 
August 8, 1997 at the American Type Culture Collection, 12301 Park Lawn Drive, Rockville, 
Maryland 20852, and given accession number 202012. 

Some ORFs contained in the subset of fragments of the B. burgdorferi genome disclosed 
herein were derived through the use of a number of screening criteria detailed below. The ORFs 
are generally bounded at the amino terminus by a methionine residue and at the carboxy terminus 
by a stop codon. 

Many of the selected sequences do not consist of complete ORFs. Although a polypeptide 
representing a complete ORF may be the closest approximation of a protein native to an organism, 
it is not always preferred to express a complete ORF in a heterologous system. It may be 
challenging to express and purify a highly hydrophobic protein by common laboratory methods. 
Some of the polypeptide vaccine candidates described herein have been modified slightly to 
simplify the production of recombinant protein. For example, nucleotide sequences which encode 
highly hydrophobic domains, such as those found at the amino terminal signal sequence, have 
been excluded from some constructs used for in vitro expression of the polypeptides. 
Furthermore, any highly hydrophobic amino acid sequences occurring at the carboxy terminus 
have also been excluded from the recombinant expression constructs. Thus, in one embodiment, 
a polypeptide which represents a truncated or modified ORF may be used as an antigen. 

While numerous methods are known in the art for selecting potentially immunogenic 
polypeptides, many of the ORFs disclosed herein were selected on the basis of screening all 
theoretical Borrelia burgdorferi ORFs for several aspects of potential immunogenicity. One set of 
selection criteria are as follows: 

1 . Type I signal sequence: An amino terminal type I signal sequence generally directs a 
nascent protein across the plasma and outer membranes to the exterior of the bacterial cell. 
Experimental evidence obtained from studies with Escherichia coli suggests that the typical type I 
signal sequence consists of the following biochemical and physical attributes (Izard, J. W. and 
Kendall, D. A. Mol. Microbiol. 13:165-113 (1994)). The length of the type I signal sequence is 
approximately 15 to 25 primarily hydrophobic amino acid residues with a net positive charge in 
the extreme amino terminus. In addition, the central region of the signal sequence adopts an 
alpha-helical conformation in a hydrophobic environment. Finally, the region surrounding the 
actual site of cleavage is ideally six residues long, with small side-chain amino acids in the -1 and 
-3 positions. 

2. Type TV signal sequence: The type IV signal sequence is an example of the several 
types of functional signal sequences which exist in addition to the type I signal sequence detailed 
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above. Although functionally related, the type IV signal sequence possesses a unique set of 
biochemical and physical attributes (Strom, M. S. and Lory, S., J. Bacteriol. 774:7345-7351 
(1992)). These are typically six to eight amino acids with a net basic charge followed by an 
additional sixteen to thirty primarily hydrophobic residues. The cleavage site of a type IV signal 
5 sequence is typically after the initial six to eight amino acids at the extreme amino terminus. In 
addition, type IV signal sequences generally contain a phenylalanine residue at the +1 site relative 
to the cleavage site. 

3. Lipoprotein: Studies of the cleavage sites of twenty-six bacterial lipoprote in precursors 
has allowed the definition of a consensus amino acid sequence for lipoprotein cleavage. Nearly 

10 three-fourths of the bacterial lipoprotein precursors examined contained the sequence L-(A,S)- 
(G,A)-C at positions -3 to +1, relative to the point of cleavage (Hayashi, S. and Wu, H. C, J. 
g Bioenerg. Biomembr. 22:451-471 (1990)). 

4. LPXTG motif: It has been experimentally determined that most anchored proteins 
y?>* found on the surface of gram-positive bacteria possess a highly conserved carboxy terminal 

?f ij> sequence. More than fifty such proteins from organisms such as S. pyogenes, S. mutans, B. 
W burgdorferi, S. pneumoniae, and others, have been identified based on their extracellular location 
% f and carboxy terminal amino acid sequence (Fischetti, V. A., ASM News 62:405-410 (1996)). 
53 The conserved region consists of six charged amino acids at the extreme carboxy terminus 
~l~ coupled to 15-20 hydrophobic amino acids presumed to function as a transmembrane domain. 
•2Q Immediately adjacent to the transmembrane domain is a six amino acid sequence conserved in 
Q nearly all proteins examined. The amino acid sequence of this region is L-P-X-T-G-X, where X 
is any amino acid. 

An algorithm for selecting antigenic and immunogenic Borrelia burgdorferi polypeptides 
including the foregoing criteria was developed. The algorithm is similar to that described in U.S. 
25 patent application 08/78 1 ,986, filed January 3, 1997, which is fully incorporated by reference 
herein.- Use of the algorithm by the inventors to select immunologically useful Borrelia 
burgdorferi polypeptides resulted in the selection of a number of the disclosed ORFs. 
Polypeptides comprising the polypeptides identified in this group may be produced by techniques 
standard in the art and as further described herein. 

30 

Nucleic Acid Molecules 

The present invention provides isolated nucleic acid molecules comprising polynucleotides 
encoding the B. burgdorferi polypeptides having the amino acid sequences shown in Table 1, 
which were determined by sequencing the genome of B. burgdorferi deposited as ATCC deposit 
35 no. 202012 and selected as putative immunogens. 

Unless otherwise indicated, all nucleotide sequences determined by sequencing a DNA 
molecule herein were determined using an automated DNA sequencer (such as the Model 373 
from Applied Biosystems, Inc.), and all amino acid sequences of polypeptides encoded by DNA 
molecules determined herein were predicted by translation of DNA sequences determined as 



above. Therefore, as is known in the art for any DNA sequence determined by this automated 
approach, any nucleotide sequence determined herein may contain some errors. Nucleotide 
sequences determined by automation are typically at least about 90% identical, more typically at 
. least about 95% to at least about 99.9% identical to the actual nucleotide sequence of the 
sequenced DNA molecule. The actual sequence can be more precisely determined by other 
approaches including manual DNA sequencing methods well known in the art. As is also known 
in the art, a single insertion or deletion in a determined nucleotide sequence compared to the actual 
sequence will cause a frame shift in translation of the nucleotide sequence such that the predicted 
amino acid sequence encoded by a determined nucleotide sequence will be completely different 
from the amino acid sequence actually encoded by the sequenced DNA molecule, beginning at the 
point of such an insertion or deletion. 

Unless otherwise indicated, each "nucleotide sequence" set forth herein is presented as a 
sequence of deoxyribonucleotides (abbreviated A, G , C and T). However, by "nucleotide 
sequence" of a nucleic acid molecule or polynucleotide is intended, for a DNA molecule or 
polynucleotide, a sequence of deoxyribonucleotides, and for an RNA molecule or polynucleotide, 
the corresponding sequence of ribonucleotides (A, G, C and U), where each thymidine 
deoxyribonucleotide (T) in the specified deoxyribonucleotide sequence is replaced by the 
ribonucleotide uridine (U). For instance, reference to an RNA molecule having a sequence of 
Table 1 set forth using deoxyribonucleotide abbreviations is intended to indicate an RNA molecule 
having a sequence in which each deoxyribonucleotide A, G or C of Table 1 has been replaced by 
the corresponding ribonucleotide A, G or C, and each deoxyribonucleotide T has been replaced 
by a ribonucleotide U. 

Nucleic acid molecules of the present invention may bejn the form of RNA, such as 
mRNA, or in the form of DNA, including, for instance, cDNA and genomic DNA obtained by 
cloning or produced synthetically. The DNA may be double-stranded or single-stranded. 
Single-stranded DNA or RNA may be the coding strand, also known as the sense strand, or it 
may be the non-coding strand, also referred to as the anti-sense strand. 

By "isolated" nucleic acid molecule(s) is intended a nucleic acid molecule, DNA or RNA. 
which has been removed from its native environment. For example, recombinant DNA molecules 
contained in a vector are considered isolated for the purposes of the present invention. Further 
examples of isolated DNA molecules include recombinant DNA molecules maintained in 
heterologous host cells or purified (partially or substantially) DNA molecules in solution. Isolated 
RNA molecules include in vivo or in vitro RNA transcripts of the DNA molecules of the present 
invention. Isolated nucleic acid molecules according to the present invention further include such 
molecules produced synthetically. 

In addition, isolated nucleic acid molecules of the invention include DNA molecules which 
comprise a sequence substantially different from those described above but which, due to the 
degeneracy of the genetic code, still encode a B. burgdorferi polypeptides and peptides of the 
present invention (e.g. polypeptides of Table 1). That is, all possible DNA sequences that encode 
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the B, burgdorferi polypeptides of the present invention. This includes the genetic code and 
species-specific codon preferences known in the art. Thus, it would be routine for one skilled in 
the art to generate the degenerate variants described above, for instance, to optimize codon 
expression for a particular host (e.g., change codons in the bacteria mRNA to those preferred by a 
5 mammalian or other bacterial host such as E. coli). 

The invention further provides isolated nucleic acid molecules having the nucleotide 
sequence shown in Table 1 or a nucleic acid molecule having a sequence complementary to one of 
the above sequences. Such isolated molecules, particularly DNA molecules, are useful as probes 
for gene mapping and for identifying B. burgdorferi in a biological sample, for instance, by PCR, 
. 10 Southern blot, Northern blot, or other form of hybridization analysis. 

The present invention is further directed to nucleic acid molecules encoding portions or 
fragments of the nucleotide sequences described herein. Fragments include portions of the 
nucleotide sequences of Table 1 at least 10 contiguous nucleotides in length selected from any two 
integers, one of which representing a 5' nucleotide position and a second of which representing a 

"15 3' nucleotide position, where the first nucleotide for each nucleotide sequence in Table 1 is 

position 1. That is, every combination of a 5' and 3' nucleotide position that a fragment at least 
10 contiguous nucleotides in length could occupy is included in the invention. "At least" means a 
fragment may be 10 contiguous nucleotide base's in length or any integer between 10 and the 
length of an entire nucleotide sequence of Table 1 minus 1. Therefore, included in the invention 

20 are contiguous fragments specified by any 5' and 3' nucleotide base positions of a nucleotide 

sequences of Table 1 wherein the contiguous fragment is any integer between 10 and the length of 
an entire nucleotide sequence minus 1. 

Further, the invention includes polynucleotides comprising fragments specified by size, in 
nucleotides, rather than by nucleotide positions. The invention includes any fragment size, in 

25 contiguous nucleotides, selected from integers between 10 and the length of an entire nucleotide 
sequence minus 1. Preferred sizes of contiguous nucleotide fragments include 20 nucleotides, 30 
nucleotides, 40 nucleotides, 50 nucleotides. Other preferred sizes of contiguous nucleotide 
fragments, which may be useful as diagnostic probes and primers, include fragments 50-300 
nucleotides in length which include, as discussed above, fragment sizes representing each integer 

30 between 50-300. Larger fragments are also useful according to the present invention 

corresponding to most, if not all, of the nucleotide sequences shown in Table lor of the B. 
burgdorferi nucleotide sequences of the plasimd clones listed in Table 1. The preferred sizes are, 
of course, meant to exemplify not limit the present invention as all size fragments, representing 
any integer between 10 and the length of an entire nucleotide sequence minus 1, are included in 

35 the invention. Additional preferred nucleic acid fragments of the present invention include 
nucleic acid molecules encoding epitope-beaiing portions of B. burgdorferi polypeptides 
. identified in Table 4. 

The present invention also provides for the exclusion of any fragment, specified by 5' and 
3' base positions or by size in nucleotide bases as described above for any nucleotide sequence of 
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Table 1 or the plasimd clones listed in Table 1. Any number of fragments of nucleotide sequences 
in Table 1 or the plasimd clones listed in Table 1, specified by 5' and 3' base positions or by size 
in nucleotides, as described above, may be excluded from the present invention. 

Preferred nucleic acid fragments of the present invention also include nucleic acid 
molecules encoding epitope -bearing portions of the B. burgdorferi polypeptides shown in Table 
1. Such nucleic acid fragments of the present invention include, for example, nucleic acid 
molecules encoding polypeptide fragments comprising from about the amino terminal residue to 
about the carboxy terminal residue of each fragment shown in Table 4. The above referred to 
polypeptide fragments are antigenic regions of particular B. burgdorferi polypeptides shown in 
Table 1. Methods for determining other such epitope-bearing portions for the remaining 
polypeptides described in Table 1 are well known in the art and are described in detail below. 

In another aspect, the invention provides isolated nucleic acid molecules comprising 
polynucleotides which hybridize under stringent hybridization conditions to a portion of a 
polynucleotide in a nucleic acid molecule of the invention described above, for instance, a nucleic 
acid sequence shown in Table 1. By "stringent hybridization conditions" is intended overnight 
incubation at 42 C in a solution comprising: 50% formamide, 5x SSC (150 mM NaCl, 15 mM 
trisodium citrate), 50 mM sodium phosphate (pH 7.6), 5x Denhardt's solution, 10% dextran 
sulfate, and 20 g/ml denatured, sheared salmon sperm DNA, followed by washing the filters in 
0.1xSSCatabout65C. 

By polynucleotides which hybridize to a "portion" of a polynucleotide is intended 
polynucleotides (either DNA or RNA) which hybridize to at least about 15 nucleotides (nt), and 
more preferably at least about 20 nt, still more preferably at least about 30 nt, and even more 
preferably about 30-70 nt of the reference polynucleotide. These are useful as diagnostic probes 
and primers as discussed above and in more detail below. 

Of course, polynucleotides hybridizing to a larger portion of the reference polynucleotide, 
for instance, a portion 50-100 nt in length, or even to the entire length of the reference 
polynucleotide, are also useful as probes according to the present invention, as are 
polynucleotides corresponding to most, if not all, of a nucleotide sequence as shown in Table 1. 
By a portion of a polynucleotide of "at least 20 nt in length," for example, is intended 20 or more 
contiguous nucleotides from the nucleotide sequence of the reference polynucleotide (e.g., a 
nucleotide sequences as shown in Table 1). As noted above, such portions are useful 
diagnostically either as probes according to conventional DNA hybridization techniques or as 
primers for amplification of a target sequence by PCR, as described, for instance, in Molecular 
Cloning, A Laboratory Manual, 2nd. edition, Sambrook, J., Fritsch, E. F. and Maniatis, T., 
eds., Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. (1989), the entire 
disclosure of which is hereby incorporated herein by reference. 

Since nucleic acid sequences encoding the B. burgdorferi polypeptides of the present 
invention are provided in Table 1, generating polynucleotides which hybridize to portions of these 
sequences would be routine to the skilled artisan. For example, the hybridizing polynucleotides 
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of the present invention could be generated synthetically according to known techniques. 

As indicated, nucleic acid molecules of the present invention which encode B. burgdorferi 
polypeptides of the present invention may include, but are not limited to those encoding the amino 
acid sequences of the polypeptides by themselves; and additional coding sequences which code 
5 for additional amino acids, such as those which provide additional functionalities. Thus, the 
sequences encoding these polypeptides may be fused to a marker sequence, such as a sequence 
encoding a peptide which facilitates purification of the fused polypeptide. In certain preferred 
embodiments of this aspect of the invention, the marker amino acid sequence is a hexa-histidine 
peptide, such as the tag provided in a pQE vector (Qiagen, Inc.), among others, many of which 

10 ' are commercially available. As described in Gentz et al, Proc. Natl. Acad. Set USA <§<5:821-824 
(1989), for instance, hexa-histidine provides for convenient purification of the resulting fusion 

CI protein. 

Thus, the present invention also includes genetic fusions wherein the 
B. burgdorferi nucleic acid sequences coding sequences provided in Table 1 are linked to 
'fi additional nucleic acid' sequences to produce fusion proteins. These fusion proteins may include 
j^f epitopes of borrelial or non-borrelial origin designed to produce proteins having enhanced 
^ immunogenicity. Further, the fusion proteins of the present invention may contain antigenic 
PI determinants known to provide helper T-cell stimulation, peptides encoding sites for 
SQ post-translational modifications which enhance immunogenicity (e.g., acylation), peptides which 
|!| facilitate purification (e.g., histidine "tag"), or amino acid sequences which target the fusion 
r;;;| protein to a desired location (e.g., a heterologous leader sequence). For instance, hexa-histidine 
h * provides for convenient purification of the fusion protein. See Gentz et al. (1989) Proc. Natl. 

Acad. Sci. 86:821-24. The "HA" tag is another peptide useful for purification which corresponds 
to an epitope derived from the influenza hemagglutinin protein. See Wilson et al. (1984) Cell 
25 37:767. As discussed below, other such fusion proteins include the B. burgdorferi polypeptides 
of the present invention fused to Fc at the N- or C-terminus. 

Post-translational modification of the full-length B. burgdorferi OspA protein 
expressed in E. coli is believed to increase the immunogenicity of this protein. Erdile, L. et al, 
Infect. Immun. (57:81-90 (1993). B. burgdorferi OspA when expressed in E. coli, for example, 
30 is post-translationally modified in at least two ways. First, a signal peptide is cleaved; second, 
lipid moieties are attached. The presence of these lipid moieties is believed to confer enhanced 
immunogenicity and results in the elicitation of a strong protective immunological response. 

Variant and Mutant Polynucleotides 
35 . The present invention thus includes nucleic acid molecules and sequences which encode 

. fusion proteins comprising one or more B. burgdorferi polypeptides of the present invention 
fused to an amino acid sequence which allows for pcst-translational modification to enhance 
immunogenicity. This post-translational modification may occur either in vitro or when the fusion 
protein is expressed in vivo in a host cell. An example of such a modification is the introduction 
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of an amino acid sequence which results in the attachment of a lipid moiety. Such a lipid moiety 
attachment site of OspA, which is lipidated upon expression in E. coli, has been identified. 
Bouchon, B. et al., Anal. Biochem. 246:52-61 (1997). 

Thus, as indicated above, the present invention includes genetic fusions wherein a 
5 B. burgdorferi nucleic acid sequence provided in Table 1 is linked to a nucleotide sequence 
encoding another amino acid sequence. These other amino acid sequences may be of borrelial 
origin {e.g., another sequence selected from Table 1) or non-borrelial origin. An example of such 
a fusion protein is reported in Fikrig, E. et al, Science 250:553-556 (1990) where an OspA- 
glutathione-5-transferase fusion protein was produced and shown to elicit protective immunity 
10 against Lyme disease in immune competent mice. 

The present invention further relates to variants of the nucleic acid molecules of the present 
invention, which encode portions, analogs or derivatives of the B. burgdorferi polypeptides 
iff shown in Table 1. Variants may occur naturally, such as a natural allelic variant. By an "allelic 
fi variant" is intended one of several alternate forms of a gene occupying a given locus on a 
tip chromosome of an organism. Genes II, Lewin, B., ed., John Wiley & Sons, New York (1985). 
|ll Non-naturally occurring variants may be produced using art-known mutagenesis techniques. 

Such variants include those produced by nucleotide substitutions, deletions or additions, 
si The substitutions, deletions or additions may involve one or more nucleotides. These variants 
]** may be altered in coding regions, non-coding regions, or both. Alterations in the coding regions 
"20 may produce conservative or non-conservative amino acid substitutions, deletions or additions. 
Especially preferred among these are silent substitutions, additions and deletions, which do not 
alter the properties and activities of the B. burgdorferi polypeptides disclosed herein or portions 
thereof. Also especially preferred in this regard are conservative substitutions. 

The present application is further directed to nucleic acid molecules at least 90%, 95%, 
25 96%, 97%, 98%o or 99% identical to a nucleic acid sequence shown in Table 1. The above 

nucleic acid sequences are included irrespective of whether they encode a polypeptide having B. 
burgdorferi activity. This is because even where a particular nucleic acid molecule does not 
encode a polypeptide having B. burgdorferi activity, one of skill in the art would still know how 
to use the nucleic acid molecule, for instance, as a hybridization probe. Uses of the nucleic acid 
30 molecules of the present invention that do not encode a polypeptide having B. burgdorferi activity 
include, inter alia, isolating an B. burgdorferi gene or allelic variants thereof from a DNA library, 
and detecting B. burgdorferi mRNA expression samples, environmental samples, suspected of 
containing B. burgdorferi by Northern Blot analysis. 

Embodiments of the invention include isolated nucleic acid molecules comprising a 
35 . polynucleotide having a nucleotide sequence at least 90% identical, and more preferably at least 
95%, 96%, 97%, 98% or 99% identical to (a) a nucleotide sequence encoding any of the amino 
acid sequences of the full-length polypeptides shown in Table 1 ; (b) a nucleotide sequence 
' encoding any of the amino acid sequences of the full-length polypeptides shown in Table 1 but 
minus the N-terminal methionine residue, if present; (c) a nucleotide sequence encoding any of the 
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amino acid sequences of the truncated polypeptides shown in Table 1 ; and (d) a nucleotide 
sequence complementary to any of the nucleotide sequences in (a), (b), or (c) above. 

Preferred, are nucleic acid molecules having sequences at least 90%, 95%, 96%, 97%, 
98% or 99% identical to the nucleic acid sequence shown in Table 1, which do, in fact, encode a 
polypeptide having B. burgdorferi protein activity By "a polypeptide having B. burgdorferi 
activity" is intended polypeptides exhibiting activity similar, but not necessarily identical, to an 
activity of the B. burgdorferi protein of the invention, as measured in a particular biological assay 
suitable for measuring activity of the specified protein. 

Due to the degeneracy of the genetic code, one of ordinary skill in the art will immediately 
recognize that a large number of the nucleic acid molecules having a sequence at least 90%, 95%, 
96%, 97%, 98%, or 99% identical to the nucleic acid sequences shown in Table 1 will encode a 
polypeptide having B. burgdorferi protein activity. In fact, since degenerate variants of these 
nucleotide sequences all encode the same polypeptide, this will be clear to the skilled artisan even 
without performing the above described comparison assay. It will be further recognized in the art 
that, for such nucleic acid molecules that are not degenerate variants, a reasonable number will 
also encode a polypeptide having B. burgdorferi protein activity. This is because the skilled 
artisan is fully aware of amino acid substitutions that are either less likely or not likely to 
significantly effect protein function (e.g., replacing one aliphatic amino acid with a second 
aliphatic amino acid), as further described below. 

The biological activity or function of the polypeptides of the present invention are expected 
to be similar or identical to polypeptides from other bacteria that share a high degree of structural 
identity/similarity. Tables 2 lists accession numbers and descriptions for the closest matching 
sequences of polypeptides available through Genbank and Derwent databases. It is therefore 
expected that the biological activity or function of the polypeptides of the present invention will be 
similar or identical to those polypeptides from other bacterial genuses, species, or strains listed in 
Table 2. 

By a polynucleotide having a nucleotide sequence at least, for example, 95 % "identical" to 
a reference nucleotide sequence of the present invention, it is intended that the nucleotide sequence 
of the polynucleotide is identical to the reference sequence except that the polynucleotide sequence 
may include up to five point mutations per each 100 nucleotides of the reference nucleotide 
sequence encoding the B. burgdorferi polypeptide. In other words, to obtain a polynucleotide 
having a nucleotide sequence at least 95% identical to a reference nucleotide sequence, up to 5% 
(5 of 100) of the nucleotides in the reference sequence may be deleted, inserted, or substituted 
with another nucleotide. The query sequence may be an -entire sequence shown in Table 1, the 
ORP (open reading frame), or any fragment specified as described herein. 

As a practical matter, whether any particular nucleic acid molecule or polypeptide is at least 
90%, 95%, 96%, 97%, 98% or 99% identical to a nucleotide sequence of the presence invention 
can be determined conventionally using known computer programs. A preferred mexhod for 
determining the best overall match between a query sequence (a sequence of the present invention) 
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and a subject sequence, also referred to as a global sequence alignment, can be determined using 
the FASTDB computer program based on the algorithm of Brutlag et al. See Brutlag et al. 
(1990) Comp. App. Biosci. 6:237-245. In a sequence alignment the query and subject sequences 
are both DNA sequences. An RNA sequence can be compared by first converting U's to T's. 
The result of said global sequence alignment is in percent identity. Preferred parameters used in a 
FASTDB alignment of DNA sequences to calculate percent identity are: Matrix=Unitary, k- 
tuple=4, Mismatch Penalty=l, Joining Penalty=30, Randomization Group Length=0, Cutoff 
Score=l, Gap Penalty=5, Gap Size Penalty 0.05, Window Size=500 or the lenght of the subject 
nucleotide sequence, whichever is shorter. 

If the subject sequence is shorter than the query sequence because of 5' or 3' deletions, 
not because of internal deletions, a manual correction must be made to the results. This is because 
. the FASTDB program does not account for 5' and 3' truncations of the subject sequence when 
calculating percent identity. For subject sequences truncated at the 5' or 3' ends, relative to the 
query sequence, the percent identity is corrected by calculating the number of bases of the query 
sequence that are 5' and 3' of the subject sequence, which are not matched/aligned, as a percent of 
the total bases of the query sequence. Whether a nucleotide is matched/aligned is determined by 
results of the FASTDB sequence alignment. This percentage is then subtracted from the percent 
identity, calculated by the above FASTDB program using the specified parameters, to arrive at a 
final percent identity score. This corrected score is what is used for the purposes of the present 
invention. Only nucleotides outside the 5' and 3' nucleotides of the subject sequence, as 
displayed by the FASTDB alignment, which are not matched/aligned with the query sequence, are 
calculated for the purposes of manually adjusting the percent identity score. 

For example, a 90,nucleotide subject sequence is aligned to a 100 nucleotide query 
sequence to determine percent identity. The deletion's occur at the 5' end of the subject sequence 
and therefore, the FASTDB alignment does not show a matched/alignment of the first 10 
nucleotides at 5' end. The 10 unpaired nucleotides represent 10% of the sequence (number of 
nucleotides at the 5' and 3' ends not matched/total number of nucleotides in the query sequence) 
so 10% is subtracted from the percent identity score calculated by the FASTDB program. If the 
remaining 90 nucleotides were perfectly matched the final percent identity would be 90%. In 
another example, a 90 nucleotide subject sequence is compared with a 100 nucleotide query 
sequence. This time the deletions are internal deletions so that there are no nucleotides on the 5' 
or 3' of the subject sequence which are not matched/aligned with the query. In this case the 
percent identity calculated by FASTDB is not manually corrected. Once again, only nucleotides 
5' and 3' of the subject sequence which are not matched/aligned with the query sequence are 
manually corrected for. No other manual corrections are to made for the purposes of the present 
invention. 

Vectors and Host Cells 

The present invention also relates to vectors which include the isolated DNA molecules of 



the present invention, host cells which are genetically engineered with the recombinant vectors, 
and the production of B. burgdorferi polypeptides or fragments thereof by recombinant 
techniques. 

Recombinant constructs may be introduced into host cells using well known techniques 
such as infection, transduction, transfection, transvection, electroporation and transformation. 
The vector may be, for example, a phage, plasmid, viral or retroviral vector. Retroviral vectors 
may be replication competent or replication defective. In the latter case, viral propagation 
generally will occur only in complementing host cells. 

The polynucleotides may be joined to a vector containing a selectable marker for 
propagation in a host. Generally, a plasmid vector is introduced in a precipitate, such as a calcium 
phosphate precipitate, or in a complex with a charged lipid. If the vector is a virus, it may be 
packaged in vitro using an appropriate packaging cell line and then transduced into host cells. 

Preferred are vectors comprising eft-acting control regions to the polynucleotide of 
interest. Appropriate trans-acting factors may be supplied by the host, supplied by a 
complementing vector or supplied by the vector itself upon introduction into the host. 

In certain preferred embodiments in this regard, the vectors provide for specific 
expression, which may be inducible and/or cell type-specific. Particularly preferred among such 
vectors are those inducible by environmental factors that are easy to manipulate, such as 
temperature and nutrient additives. 

Expression vectors useful in the present invention include chromosomal-, episomal- and 
virus-derived vectors, e.g., vectors derived from bacterial plasmids, bacteriophage, yeast 
episomes, yeast chromosomal elements, viruses such as baculoviruses, papova viruses, vaccinia 
viruses, adenoviruses, fowl pox viruses, pseudorabies viruses and retroviruses, and vectors 
derived from combinations thereof, such as cosmi ds and phagemids. 

The DNA insert should be operatively linked to an appropriate promoter, such as the 
phage lambda PL promoter, the E. coli lac, trp and tac promoters, the SV40 early and late 
promoters and promoters of retroviral LTRs, to name a few. Other suitable promoters will be 
known to the skilled artisan. The expression constructs will further contain sites for transcription 
initiation, termination and, in the transcribed region, a ribosome binding site for translation. The 
coding portion of the mature transcripts expressed by the constructs will preferably include a 
translation initiating site at the beginning and a termination codon (UAA, VGA or UAG) 
appropriately positioned at the end of the polypeptide to be translated. 

As indicated, the expression vectors will preferably include at least one selectable marker. 
Such markers include dihydrofolate reductase or neomycin resistance for eukaryotic cell culture 
and tetracycline or ampicillin resistance genes for culturing in E. coli and other bacteria. 
Representative examples of appropriate hosts include, but are not limited to, bacterial cells, such 
as E. coli, Strepwmyces and Salmonella typhimurium cells; fungal cells, such as yeast cells; 
insect cells such as Drosophila S2 and Spodoptera Sf9 cells; animal cells such as CHO, COS and 
Bowes melanoma cells; and plant cells. Appropriate culture mediums and conditions for the 




above-described host cells are known in the art. 

Among vectors preferred for use in bacteria include pQE70, pQE60 and pQE-9, available 
from Qiagen; pBS vectors, Phagescript vectors, Bluescript vectors, pNH8A, pNH16a, pNH18A, 
pNH46A available from Stratagene; pET series of vectors available from Novagen; and ptrc99a, 
pKK223-3, pKK233-3, pDR540, pRIT5 available from Pharmacia. Among preferred eukaryotic 
vectors are pWLNEO, pSV2CAT, pOG44, pXTl and pSG available from Stratagene; and 
pSVK3, pBPV, pMSG and pSVL available from Pharmacia. Other suitable vectors will be 
readily apparent to the skilled artisan. 

Among known bacterial promoters suitable for use in the present invention include the E. 
coli lad and ZacZ promoters, the T3 and T7 promoters, the gpt promoter, the lambda PR and PL 
promoters and the trp promoter. Suitable eukaryotic promoters include the CMV immediate early 
promoter, the HSV thymidine kinase promoter, the early and late SV40 promoters, the promoters 
of retroviral LTRs, such as-those of the Rous sarcoma virus (RSV), and metallothionein 
promoters, such as the mouse' me tall othionein-I promoter. 

Introduction of the construct into the host cell can be effected by calcium phosphate 
transfection, DEAE-dextran mediated transfection, cationic lipid-mediated transfection, 
electroporation, transduction, infection or other methods. Such methods are described in many 
standard laboratory manuals, such as Davis et al, Basic Methods In Molecular Biology (1986). 

Transcription of DNA encoding the polypeptides of the present invention by higher 
eukaryotes may be increased by inserting an enhancer sequence into the vector. Enhancers are 
cw-acting elements of DNA, usually about from 10 to 300 bp that act to increase transcriptional 
activity of a promoter in a given host cell-type. Examples of enhancers include the S V40 
enhancer, which is located on the late side of the replication origin at bp 100 to 270, the 
cytomegalovirus early promoter enhancer, the polyoma enhancer on the late side of the replication 
origin, and adenovirus enhancers. 

For secretion of the translated polypeptide into the lumen of the endoplasmic reticulum, 
into the periplasmic space or into the extracellular environment, appropriate secretion signals may 
be incorporated into the expressed polypeptide. The signals may be endogenous to the 
polypeptide or they may be heterologous signals. 

The polypeptide may be expressed in a modified form, such as a fusion protein, and may 
include not only secretion signals, but also additional heterologous functional regions. For 
instance, a region of additional amino acids, particularly charged amino acids, may be added to 
the N-terminus of the polypeptide to improve stability and persistence in the host cell, during 
purification, or during subsequent handling and storage. Also, peptide moieties may be added to 
the polypeptide to facilitate purification. Such regions may be removed prior to final preparation 
of the polypeptide. The addition of peptide moieties to polypeptides to engender secretion or 
excretion, to improve stability and to facilitate purification, among others, are familiar and routine 
techniques in the art. A preferred fusion protein comprises a heterologous region from 
immunoglobulin that is useful to solubilize proteins. For example, EP-A-0 464 533 (Canadian 
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counterpart 2045869) discloses fusion proteins comprising various portions of constant region of 
immunoglobin molecules together with another human protein or part thereof. In many cases, the 
Fc part in a fusion protein is thoroughly advantageous for use in therapy and diagnosis and thus 
results, for example, in improved pharmacokinetic properties (EP-A 0232 262). On the other 
hand, for some uses it would be desirable to be able to delete the Fc part after the fusion protein 
has been expressed, detected and purified in the advantageous manner described. This is the case 
when Fc portion proves to be a hindrance to use in therapy and diagnosis, for example when the 
fusion protein is to be used as antigen for immunizations. In drug discovery, for example, human 
proteins, such as, ML5 -receptor has been fused with Fc portions for the purpose of 
high-throughput screening assays to identify antagonists of hIL-5. See Bennett, D. et al, J. 
Molec. Recogn. 5:52-58 (1995) and Johanson, K. et al., J. Biol. Chem. 270 (7<5j:9459-9471 
(1995). 

The B. burgdorferi polypeptides can be recovered and purified from recombinant cell 
cultures by well-known methods including ammonium sulfate or ethanol precipitation, acid 
extraction, anion or cation exchange chromatography, phosphocellulose chromatography, 
hydrophobic interaction chromatography, affinity chromatography, hydroxylapatite 
chromatography, lectin chromatography and high performance liquid chromatography ("HPLC") 
is employed for purification. Polypeptides of the present invention include naturally purified 
products, products of chemical synthetic procedures, and products produced by recombinant 
techniques from a prokaryotic or eukaryotic host, including, for example, bacterial, yeast, higher 
plant, insect and mammalian cells. 

Polypeptides and Fragments 

The invention further provides isolated polypeptides having the amino acid sequences in 
Table 1, and peptides or polypeptides comprising portions of the above polypeptides. The terms 
"peptide" and "oligopeptide" are considered synonymous (as is commonly recognized) and each 
term can be used interchangeably as the context requires to indicate a chain of at least to amino 
acids coupled by peptidyl linkages. The word "polypeptide" is used herein for chains containing 
more than ten amino acid residues. All oligopeptide and polypeptide formulas or sequences 
herein are written from left to right and in the direction from amino terminus to carboxy terminus. 

As discussed in detail below, immunization using B. burgdorferi sensu stricto isolate B31 
decorin-binding protein elicits the production of antiserum which confers passive immunity 
against Borrelia species and strains which express divergent forms of this protein. Cassatt, D. et 
al, Protection of Borrelia burgdorferi Infection by Antibodies to Decorin-binding Protein, in 
Vaccines 97, Cold Spring Harbor Press (1997), pages 191-195. Thus, some amino acid 
sequences of the B. burgdorferi polypeptides shown in Table 1 can be varied without 
significantly effecting the antigenicity of the polypeptides. If such differences in sequence are 
contemplated, it should be remembered that there will be critical areas on the polypeptide which 
determine antigenicity. In general, it is possible to replace residues which do not form part of an 
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antigenic epitope without significantly effecting the antigenicity of a polypeptide. 
Variant and Mutant Polypeptides 

To improve or alter the characteristics of B. burgdorferi polypeptides of the present 
invention, protein engineering may be employed. Recombinant DNA technology known to those 
5 skilled in the art can be used to create novel mutant proteins or muteins including single or 
multiple amino acid substitutions, deletions, additions, or fusion proteins. Such modified 
polypeptides can show, e.g., enhanced activity or increased stability. In addition, they may be 
purified in higher yields and show better solubility than the corresponding natural polypeptide, at 
least under certain purification and storage conditions. 

10 

N-Terminal and C-Terminal Deletion Mutants 

It is known in the art that one or more amino acids may be deleted from the N-terminus or 
C-terminus without substantial loss of biological function. For instance, Ron et al. J. Biol. 
Chem., 268:2984-2988 (1993), reported modified KGF proteins that had heparin binding 

;#5 activity even if 3, 8, or 27 N-terminal amino acid residues were missing. Accordingly, the 
present invention provides polypeptides having one or more residues deleted from the amino 

ss terminus of the amino acid sequence of the B. burgdorferi polypeptides shown in Table 1, and 
polynucleotides encoding such polypeptides. 

Similarly, many examples of biologically functional C-terminal deletion muteins are 
known. For instance, Interferon gamma shows up to ten times higher activities by deleting 8-10 
ammo acid residues from the carboxy terminus of the protein See, e.g., Dobeli, et al. (1988) J. 
Biotechnology 7: 199-216. Accordingly, the present invention provides polypeptides having one 
or more residues from the carboxy terminus of the amino acid sequence of the B. burgdorferi 
polypeptides shown in Table 1. The invention also provides polypeptides having one or more 

25 amino acids deleted from both the amino and the carboxyl termini as described below. 

The present invention is further directed to polynucleotide encoding portions or fragments 
of the amino acid sequences described herein as well as to portions or fragments of the isolated 
amino acid sequences described herein. Fragments include portions of the amino acid sequences 
of Table 1, are at least 5 contiguous amino acid in length, are selected from any two integers, one 

30 of which representing a N-terminal position. The initiation codon of the polypeptides of the 

present inventions position 1. Every combination of a N-terminal and C-terminal position that a 
fragment at least 5 contiguous amino acid residues in length could occupy, on any given amino 
acid sequence of Table 1 is included in the invention. At least means a fragment may be 5 
contiguous amino acid residues in length or any integer between 5 and the number of residues in a 

35 full length amino acid sequence minus 1 . Therefore, included in the invention are contiguous 
fragments specified by any N-terminal and C-terminal positions of amino acid sequence set forth 
in Table 1 wherein the contiguous fragment is any integer between 5 and the number of residues 
in a full length sequence minus 1. 

Further, the invention includes polypeptides comprising fragments specified by size, in 
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amino acid residues, rather than by N-terminal and C-terminal positions. The invention includes 
any fragment size, in contiguous amino acid residues, selected from integers between 5 and the 
number of residues in a full length sequence minus 1. Preferred sizes of contiguous polypeptide 
fragments include about 5 amino acid residues, about 10 amino acid residues, about 20 amino 
5 acid -residues, about 30 amino acid residues, about 40 amino acid residues, about 50 amino acid 
residues, about 100 amino acid residues, about 200 amino acid residues, about 300 amino acid 
residues, and about 400 amino acid residues. The preferred sizes are, of course, meant to 
exemplify, not limit, the present invention as all size fragments representing any integer between 5 
and the number of residues in a full length sequence minus 1 are included in the invention. The 

10 present invention also provides for the exclusion of any fragments specified by N-terminal and C- 
terminal positions or by size in amino acid residues as described above. Any number of 

3 fragments specified by N-terminal and C-terminal positions or by size in amino acid residues as 

% described above may be excluded. 

u The above fragments need not be active since they would be useful, for example, in 

~p immunoassays, in epitope mapping, epitope tagging, to generate antibodies to a particular portion 

:: J of the protein, as vaccines, and as molecular weight markers. 

■■■<■■> Other Mutants 

0 In addition to N- and C-terminal deletion forms of the protein discussed above, it also will 

r ip be recognized by one of ordinary skill in the art that some amino acid sequences of the B. 

;;!! burgdorferi polypeptide can be varied without significant effect of the structure or function of the 

"* protein. If such differences in sequence are contemplated, it should be remembered that there will 
be critical areas on the protein which determine activity. 

Thus, the invention further includes variations of the B. burgdorferi polypeptides which 

25 show substantial B. burgdorferi' polypeptide activity or which include regions of B. burgdorferi 
protein such as the protein portions discussed below. Such mutants include deletions, insertions, 
inversions, repeats, and type substitutions selected according to general rules known in the art so 
as to have little effect on activity. For example, guidance concerning how to make phenotypically 
silent amino acid substitutions is provided. There are two main approaches for studying the 

30 tolerance of an amino acid sequence to change. See, Bowie, J. U. et al. (1990), Science 

247:1306-1310. The first method relies on the process of evolution, in which mutations are either 
accepted or rejected by natural selection. The second approach uses genetic engineering to 
introduce amino acid changes at specific positions of a cloned gene and selections or screens to 
identify sequences that maintain functionality. 

35 These studies have revealed that proteins are surprisingly tolerant of amino acid 

substitutions. The studies indicate which amino acid changes are likely to be permissive at a 
certain position of the protein. For example, most buried amino acid residues require nonpolar 
side chains, whereas few features of surface side chains are generally conserved. Other such 
phenotypically silent substitutions are described by Bowie et ai. (supra) and the references cited 



therein. Typically seen as conservative substitutions are the replacements, one for another, 
among the aliphatic amino acids Ala, Val, Leu and He; interchange of the hydroxyl residues Ser 
and Thr, exchange of the acidic residues Asp and Glu, substitution between the amide residues 
Asn and Gin, exchange of the basic residues Lys and Arg and replacements among the aromatic 
residues Phe, Tyr. 

Thus, the fragment, derivative, analog, or homolog of the polypeptide of Table 1, or that 
encoded by the plaimds listed in Table 1, may be: (i) one in which one or more of the amino acid 
residues are substituted with a conserved or non-conserved amino acid residue (preferably a 
conserved amino acid residue) and such substituted amino acid residue may or may not be one 
encoded by the genetic code: or (ii) one in which one or more of the amino acid residues includes 
a substituent group: or (iii) one in which the B. burgdorferi polypeptide is fused with another 
compound, such as a compound to increase the half-life of the polypeptide (for example, 
■ . polyethylene glycol): or (iv) one in which the additional amino acids are fused to the above form 
of the polypeptide, such as an IgG Fc fusion region peptide or leader or secretory sequence or a 
sequence which is employed for purification of the above form of the polypeptide or a proprotein 
sequence. Such fragments, derivatives and analogs are deemed to be within the scope of those 
skilled in the art from the teachings herein. 

Thus, the B. burgdorferi polypeptides of the present invention may include one or more 
amino acid substitutions, deletions, or additions, either from natural mutations or human 
manipulation. As indicated, changes are preferably of a minor nature, such as conservative amino 
acid substitutions that do not significandy affect the folding or activity of the protein (see Table 3). 

Amino acids in the B. burgdorferi proteins of the present invention that are essential for 
function can be identified by methods known in the art, such as site-directed mutagenesis or 
alanine-scanning mutagenesis. See, e.g., Cunningham et al. (1989) Science 244:1081-1085. 
The latter procedure introduces single alanine mutations at every residue in the molecule. The 
resulting mutant molecules are then tested for biological activity using assays appropriate for 
measuring the function of the particular protein. 

Of special interest are substitutions of charged amino acids with other charged or neutral 
amino acids which may produce proteins with highly desirable improved characteristics, such as 
less aggregation. Aggregation may not only reduce activity but also be problematic when 
preparing pharmaceutical formulations, because aggregates can be immunogenic. See, e.g., 
Pinckard et al., (1967) Clin. Exp. Immunol. 2:331-340; Robbins, et al., (1987) Diabetes 36:838- 
845; Cleland, et al., (1993) Crit. Rev. Therapeutic Drug Carrier Systems 10:307-377. 

The polypeptides of the present invention are preferably provided in an isolated form, and 
preferably are substantially purified. A recombinantly produced version of the B. burgdorferi 
polypeptide can be substantially purified by the one-step method described by Smith et al. (1988) 
Gene 67:31-40. Polypeptides of the invention also can be purified from natural or recombinant 
sources using antibodies directed against the polypeptides of the invention in methods which are 
well known in the art of protein purification. 
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The invention further provides for isolated B. burgdorferi polypeptides comprising an 
amino acid sequence selected from the group consisting of: (a) the amino acid sequence of a full- 
length B. burgdorferi polypeptide having the complete amino acid sequence shown in Table 1; 
(b) the amino acid sequence of a full-length B. burgdorferi polypeptide having the complete amino 
acid sequence shown in Table 1 excepting the N-terminal methionine; (c) the complete amino acid 
sequence encoded by the plaimds listed in Table 1; and (d) the complete amino acid sequence 
excepting the N-terminal methionine encoded by the plaimds listed in Table 1. The polypeptides 
of the present invention also include polypeptides having an amino acid sequence at least 80% 
identical, more preferably at least 90% identical, and still more preferably 95%, 96%, 97%, 98% 
or 99% identical to those described in (a), (b), (c), and (d) above. 

Further polypeptides of the present invention include polypeptides which have at least 
90% similarity, more preferably at least 95% similarity, and still more preferably at least 96%, 
97%, 98% or 99% similarity to those described above. 

A further embodiment of the invention relates to a polypeptide which comprises the amino 
acid sequence of a B. burgdorferi polypeptide having an amino acid sequence which contains at 
least one conservative amino acid substitution, but not more than 50 conservative amino acid 
substitutions, not more than 40 conservative amino acid substitutions, not more than 30 
conservative amino acid substitutions, and not more than 20 conservative amino acid 
substitutions. Also provided are polypeptides which comprise the amino acid sequence of a B. 
burgdorferi polypeptide, having at least one, but not more than 10, 9, 8, 7, 6, 5, 4, 3, 2 or 1 
conservative amino acid substitutions. 

By a polypeptide having an amino acid sequence at least, for example, 95% "identical" to 
a query amino acid sequence of the present invention, it is intended that the amino acid sequence 
of the subject polypeptide is identical to the query sequence except that the subject polypeptide 
sequence may include up to five amino acid alterations per each 100 amino acids of the query 
amino acid sequence. In other words, to obtain a polypeptide having an amino acid sequence at 
least 95% identical to a query amino acid sequence, up to 5% of the amino acid residues in the 
subject sequence may be inserted, deleted, (indels) or substituted with another amino acid. These 
alterations of the reference sequence may occur at the amino or carboxy terminal positions of the 
reference amino acid sequence or anywhere between those terminal positions, interspersed either 
individually among residues in the reference sequence or in one or more contiguous groups within 
the reference sequence. 

As a practical matter, whether any particular polypeptide is at least 90%, 95%, 96%, 97%, 
.98% or 99% identical to, for instance, the amino acid sequences shown in Table 1 or to the amino 
acid sequence encoded by the plaimds listed in Table 1 can be determined conventionally using 
known computer programs. A preferred method for determining the best overall match between a 
query sequence (a sequence of the present invention) and a subject sequence, also referred to as a 
global sequence alignment, can be determined using the FASTDB computer program based on the 
algorithm of Brutlag et al., (1990) Comp. App. Biosci. 6:237-245. In a sequence alignment the 



24 

query and subject sequences are both amino acid sequences. The result of said giobal sequence 
alignment is in percent identity. Preferred parameters used in a FASTDB amino acid alignment 
are: Matrix=PAM 0, k-tuple=2, Mismatch Penalty=l, Joining Penalty=20, Randomization Group 
Length=0, Cutoff Score=l, Window Size=sequence length, Gap Penalty=5, Gap Size 
Penalty=0.05, Window Size=500 or the length of the subject amino acid sequence, whichever is 
shorter. 

If the subject sequence is shorter than the query sequence due to N- or C-terminal 
deletions, not because of internal deletions, the results, in percent identity, must be manually 
corrected. This is because the FASTDB program does not account for N- and C-terminal 
truncations of the subject sequence when calculating global percent identity. For subject 
sequences truncated at the N- and C-termini, relative to the query sequence, the percent identity is 
corrected by calculating the number of residues of the query sequence that are N- and C-terminal 
of the subject sequence, which are not matched/aligned with a corresponding subject residue, as a 
percent of the total bases of the query sequence. Whether a residue is matched/aligned is 
determined by results of the FASTDB sequence alignment. This percentage is then subtracted 
from the percent identity, calculated by the above FASTDB program using the specified 
parameters, to arrive at a final percent identity score. This final percent identity score is what is 
used for the purposes of the present invention. Only residues to the N- and C-termini of the 
subject sequence, which are not matched/aligned with the query sequence, are considered for the 
purposes of manually adjusting the percent identity score. That is, only query amino acid 
residues outside the farthest N- and C-terminal residues of the subject sequence. 

For example, a 90 amino acid residue subject sequence is aligned with a 100 residue query 
sequence to determine percent identity. The deletion occurs at the N-terminus of the subject 
sequence and therefore, the FASTDB alignment does not match/align with the first 10 residues at 
the N-terminus. The 10 unpaired residues represent 10% of the sequence (number of residues at 
the N- and C- termini not matched/total number of residues in the query sequence) so 10% is 
subtracted from the percent identity score calculated by the FASTDB program. If the remaining 
90 residues were perfectly matched the final percent identity would be 90%. In another example, 
a 90 residue subject sequence is compared with a 100 residue query sequence. This time the 
deletions are internal so there are no residues at the N- or C-termini of the subject sequence which 
are not matched/aligned with the query. In this case the percent identity calculated by FASTDB is 
not manually corrected. Once again, only residue positions outside the N- and C-terminal ends of 
the subject sequence, as displayed in the FASTDB alignment, which are not matched/aligned 
with the query sequence are manually corrected. No other manual corrections are to made for the 
purposes of the present invention. 

The above polypeptide sequences are included irrespective of whether they have their 
normal biological activity. This is because even where a particular polypeptide molecule does not 
have biological activity, one of skill in the art would still know how to use the polypeptide, for 
instance, as a vaccine or to generate antibodies. Other uses of the polypeptides of the present 



invention that do not have B. burgdorferi activity include, inter alia, as epitope tags, in epitope 
mapping, and as molecular weight markers on SDS-PAGE gels or on molecular sieve gel 
filtration columns using methods known to those of skill in the art. 

As described below, the polypeptides of the present invention can also be used to raise 
polyclonal and monoclonal antibodies, which are useful in assays for detecting B. burgdorferi 
protein expression or as agonists and antagonists capable of enhancing or inhibiting B. 
burgdorferi protein function. Further, such polypeptides can be used in the yeast two-hybrid 
system to "capture" B. burgdorferi protein binding proteins which are also candidate agonists and 
antagonists according to the present invention. See, e.g., Fields et al. (1989) Nature 
340:245-246. 

Epitope -Bearing Portions 

In another aspect, the invention- provides peptides and polypeptides comprising 
epitope-bearing portions of the B. burgdorferi polypeptides of the present invention. These 
epitopes are immunogenic or antigenic epitopes of the polypeptides of the present invention. An - 
■ "immunogenic epitope" is defined as a part of a protein that elicits an antibody response when the 
whole protein or polypeptide is the immunogen. These immunogenic epitopes are believed to be 
confined to a few loci on the molecule. On the other hand, a region of a protein molecule to 
which an antibody can bind is defined as an "antigenic determinant" or "antigenic epitope." The 
number of immunogenic epitopes of a protein generally is less than the number of antigenic 
epitopes. See, e.g., Geysen, et al. (1983) Proc. Natl. Acad. Sci. USA 81:3998- 4002. 
Piedicted antigenic epitopes are shown in Table 4, below. It is pointed out that Table 4 only lists 
amino acid residues comprising epitopes predicted to have the highest degree of antigenicity. The 
polypeptides not listed in Table 4 and portions of polypeptides not listed in Table 4 are not 
considered non-antigenic. This'is because they may still be antigenic in vivo but merely not 
recognized as such by the particular algorithm used. Thus, Table 4 lists the amino acid residues 
comprising preferred antigenic epitopes but not a complete list. Amino acid residues comprising 
other anigenic epitopes may be determined by algorithms similar to the Jameson-Wolf analysis or 
by in vivo testing for an antigenic response using the methods described herein or those known in 
the art. 

As to the selection of peptides or polypeptides bearing an antigenic epitope {i.e., that 
contain a region of a protein molecule to which an antibody can bind), it is well known in that art 
that relatively short synthetic peptides that mimic part of a protein sequence are routinely capable 
of eliciting an antiserum that reacts with the partially mimicked protein. See, e.g., Sutcliffe, et 
al., (1983) Science 219:660-666. Peptides capable of eliciting protein-reactive sera are frequently 
represented in the primary sequence of a protein, can be characterized by a set of simple chemical 
rules, and are confined neither to immunodominant regions of intact proteins {i.e., immunogenic 
epitopes) nor to the amino or carboxyl terminals. Peptides that are extremely hydrophobic and 
those of six or fewer residues generally are ineffective at inducing antibodies that bind to the 
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mimicked protein; longer, peptides, especially those containing proline residues, usually are 
effective. See, Sutcliffe, et al., supra, p. 661. For instance, 18 of 20 peptides designed 
according to these guidelines, containing 8-39 residues covering 75% of the sequence of the 
influenza virus hemagglutinin HA1 polypeptide chain, induced antibodies that reacted with the 
HA1 protein or intact virus; and 12/12 peptides from the MuLV polymerase and 18/18 from the 
rabies glycoprotein induced antibodies that precipitated the respective proteins. 

Antigenic epitope-bearing peptides and polypeptides of the invention are therefore useful 
to raise antibodies, including monoclonal antibodies, that bind specifically to a polypeptide of the 
invention. Thus, a high proportion of hybridomas obtained by fusion of spleen cells from donors 
immunized with an antigen epitope-bearing peptide generally secrete antibody reactive with the 
native protein. See Sutcliffe, et al., supra, p. 663. The antibodies raised by antigenic 
epitope-bearing peptides or polypeptides are useful to detect the mimicked protein, and antibodies 
to different peptides may be used for tracking the fate of various regions of a protein precursor 
which undergoes post-translational processing. The peptides and anti-peptide antibodies may be 
used in a variety of qualitative or quantitative assays for the mimicked protein, for instance in 
competition assays since it has been shown that even short peptides (e.g., about 9 amino acids) 
can bind and displace the larger peptides in immunoprecipitation assays. See, e.g., Wilson, et 
al., (1984) Cell 37:767-778. The anti-peptide antibodies of the invention also are useful for 
purification of the mimicked protein, for instance, by adsorption chromatography using methods 
known in the art. 

Antigenic epitope-bearing peptides and polypeptides of the invention designed according 
to the above guidelines preferably contain a sequence of at least seven, more preferably at least 
nine and most preferably between about 10 to about 50 amino acids (i.e. any integer between 7 
and 50) contained within the amino acid sequence of a polypeptide of the invention. However, 
peptides or polypeptides comprising a larger portion of an amino acid sequence of a polypeptide 
of the invention, containing about 50 to about 100 amino acids, or any length up to and including 
the entire amino acid sequence of a polypeptide of the invention, also are considered 
epitope-bearing peptides or polypeptides of the invention and also are useful for inducing 
antibodies that react with the mimicked protein. Preferably, the amino acid sequence of the 
epitope-bearing peptide is selected to provide substantial solubility in aqueous solvents {i.e., the 
sequence includes relatively hydrophilic residues and highly hydrophobic sequences are 
preferably avoided); and sequences containing proline residues are particularly preferred. 

Non-limiting examples of antigenic polypeptides or peptides that can be used to generate 
an Borrelia-specific immune response or antibodies include portions of the amino acid sequences 
identified in Table 1. More specifically, Table 4 discloses a list of non-hmiting residues that are 
involved in the antigenicity of the epitope-bearing fragments of the present invention. Therefore, 
the present inventions provides for isolatd and purified antigenic epitope-bearing fragements of 
the polypeptides of the present invention comprising a peptide sequences of Table 4. The 
antigenic epitope-bearing fragments comprising a pep tide sequence of Table 4 preferably contain a 
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sequence of at least seven, more preferably at least nine and most preferably between about 10 to 
about 50 amino acids (i.e. any integer between 7 and 50) of a polypeptide of the present 
invention. That is, included in the present invention are antigenic polypeptides between the 
integers of 7 and 50 amino acid in length comprising one or more of the sequences of Table 4. 
5 Therefore, in most cases, the polypeptides of Table 4 make up only a portion of the antigenic 

polypeptide. All combinations of sequences between the integers of 7 and 50 amino acid in length 
comprising one or more of the sequences of Table 4 are included. The antigenic epitope-bearing 
fragements may be specified by either the number of contiguous amino acid residues or by 
specific N-terminal and C-terminal positions as described above for the polypeptide fragements of 

10 the present invention, wherein the initiation codon is residue 1. Any number of, the described 
antigenic epitope-bearing fragements of the present invention may also be excluded from the 
present invention in the same manner. 

The epitope-bearing peptides and polypeptides of the invention may be produced by any 
conventional means for making peptides or polypeptides including recombinant means using 

<45 nucleic acid molecules' of the invention. For instance, an epitope-bearing amino acid sequence of 
the present invention may be fused to a larger polypeptide which acts as a carrier during 
recombinant production and purification, as well as during immunization to produce anti-peptide 

? , antibodies. Epitope-bearing peptides also may be synthesized using known methods of chemical 
synthesis. For instance, Houghten has described a simple method for synthesis of large numbers 

20 of peptides, such as 10-20 mg of 248 different 13 residue peptides representing single amino acid 

H variants of a segment of the HA1 polypeptide which were prepared and characterized Oby 

rr ELISA-type binding studies) in less than four weeks (Houghten, R. A. Proc. Natl. Acad. Sci. 
USA 82:5131-5135 (1985)). This "Simultaneous Multiple Peptide Synthesis (SMPS)" process 
is further described in U.S. Patent No. 4,631,211 to Houghten and coworkers (1986). In this 

25 procedure the individual resins for the solid-phase synthesis of various peptides are contained in 
separate solvent-permeable packets, enabling the optimal use of the many identical repetitive steps 
involved in solid-phase methods. A completely manual procedure allows 500-1000 or more 
syntheses to be conducted simultaneously (Houghten et al. (1985) Proc. Natl. Acad. Sci. 
82:5131-5135 at 5134. 

30 Epitope-bearing peptides and polypeptides of the invention are used to induce antibodies 

according to methods well known in the art. See, e.g., Sutcliffe, et al., supra;; Wilson, et al., 
, supra;; and Bittle, et al. (1985) J. Gen. Virol. 66:2347-2354. Generally, animals may be 

immunized with free peptide; however, anti-peptide antibody titer may be boosted by coupling of 
the peptide to a macromolecular carrier, such as keyhole limpet hemacyanin (KLH) or tetanus 

35 toxoid. For instance, peptides containing cysteine may be coupled to carrier using a linker such 
as m-maleimidobenzoyl-N-hydroxysuccinimide ester (MBS), while other peptides may be 
coupled to carrier using a more general linking agent such as glutaraldehyde. Animals such as 
rabbits, rats and mice are immunized with either free or carrier-coupled peptides, for instance, by 
intraperitoneal and/or intradermal injection of emulsions containing about 100 u.g peptide or 
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carrier protein and Freund's adjuvant. Several booster injections may be needed, for instance, at 
intervals of about two weeks, to provide a useful titer of anti-peptide antibody which can be 
detected, for example, by ELISA assay using free peptide adsorbed to a solid surface. The titer of 
anti-peptide antibodies in serum from an immunized animal may be increased by selection of 
anti-peptide antibodies, for instance, by adsorption to the peptide on a solid support and elution of 
the selected antibodies according to methods well known in the art. 

Immunogenic epitope -bearing peptides of the invention, i.e., those parts of a protein that 
elicit an antibody response when the whole protein is the immunogen, are identified according to 
methods known in the art. For instance, Geysen, et al, supra, discloses a procedure for rapid 
concurrent synthesis on solid supports of hundreds of peptides of sufficient purity to react in an 
ELISA. Interaction of synthesized peptides with antibodies is then easily detected without 
removing them from the support. In this manner a peptide bearing an immunogenic epitope of a 
desired protein may be identified routinely by one of ordinary skill in the art. For instance, the 
immunologically important epitope in the coat protein of foot-and-mouth disease virus was located 
by Geysen et al. supra with a resolution of seven amino acids by synthesis of an overlapping set 
of all 208 possible hexapeptides covering the entire 213 amino acid sequence of the protein. 
Then, a complete replacement set of peptides in which all 20 amino acids were substituted in turn 
at every position within the epitope were synthesized, and the particular amino acids conferring 
specificity for the reaction with antibody were determined. Thus, peptide analogs of the 
epitope-bearing peptides of the invention can be made routinely by this method. U.S. Patent No. 
4,708,781 to Geysen (1987) further describes this method of identifying a peptide bearing an 
immunogenic epitope of a desired protein. 

Further still, U.S. Patent No. 5,194,392, to Geysen (1990), describes a general method 
of detecting or determining the sequence of monomers (amino acids or other compounds) which 
is a topological equivalent of the epitope {i.e., a "mimotope") which is complementary to a 
particular paratope (antigen binding site) of an antibody of interest. More generally, U.S. Patent 
No. 4,433,092, also to Geysen (1989), describes a method of detecting or determining a 
sequence of monomers which is a topographical equivalent of a ligand which is complementary to 
the ligand binding site of a particular receptor of interest. Similarly, U.S. Patent No. 5,480,971 
to Houghten, R. A. et al. (1996) discloses linear C,-C 7 -alkyl peralkylated oligopeptides and sets 
and libraries of such peptides, as well as methods for using such oligopeptide sets and libraries 
for determining the sequence of a peralkylated oligopeptide that preferentially binds to an acceptor 
molecule of interest. Thus, non-peptide analogs of the epitope-bearing peptides of the invention 
also can be made routinely by these methods. The entire disclosure of each document cited in this 
section on "Polypeptides and Fragments" is hereby incorporated herein by reference. 

As one of skill in the art will appreciate, the polypeptides of the present invention and the 
epitope-bearing fragments thereof described above can be combined with parts of the constant 
domain of immunoglobulins (IgG), resulting in chimeric polypeptides. These fusion proteins 
facilitate purification and show an increased half-life in vivo. This has been shown, e.g., for 
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chimeric proteins consisting of the first two domains of the human CD4-polypeptide and various 
domains of the constant regions of the heavy or light chains of mammalian immunoglobulins. 
(EPA 0,394,827; Traunecker et al. (1988) Nature 331:84-86. Fusion proteins that have a 
disuifide-linked dimeric structure due to the IgG part can also be more efficient in binding and 
neutralizing other molecules than a monomelic B. burgdorferi polypeptide or fragment thereof 
alone. See Fountoulakis et al. (1995) J. Biochem. 270:3958-3964. Nucleic acids^ncoding the 
above epitopes of B. burgdorferi polypeptides can also be recombined with a gene of interest as 
an epitope tag to aid in detection and purification of the expressed polypeptide. 

Antibodies 

B. burgdorferi protein-specific antibodies for use in the present invention can be raised 
against the intact B. burgdorferi protein or an antigenic polypeptide fragment thereof, which may 
be presented together with a carrier protein, such as an albumin, to an animal system (such as 
rabbit or mouse) or, if it is long enough (at least about 25 amino acids), without a carrier. 

As used herein, the term "antibody" (Ab) or "monoclonal antibody" (Mab) is meant to 
include intact molecules, single chain whole antibodies, and antibody fragments. Antibody 
fragments of the present invention include Fab and F(ab')2 and other fragments including single- 
chain Fvs (scFv) and disuifide-linked Fvs (sdFv). Also included in the present invention are 
chimeric and humanized monoclonal antibodies and polyclonal antibodies specific for the 
polypeptides of the present invention. The antibodies of the present invention may be prepared by 
any of a variety of methods. For example, cells expressing a polypeptide of the present invention 
or an antigenic fragment thereof can be administered to an animal in order to induce the production 
of sera containing polyclonal antibodies. For example, a preparation of B. burgdorferi 
polypeptide or fragment thereof is prepared and purified to render it substantially free of natural 
contaminants. Such a preparation is then introduced into an animal in order to produce polyclonal 
antisera of greater specific activity. 

In a preferred method, the antibodies of the present invention are monoclonal antibodies or 
binding fragments thereof. Such monoclonal antibodies can be prepared using hybridoma 
technology. See, e.g., Harlow et al., ANTIBODIES: A LABORATORY MANUAL, (Cold 
Spring Harbor Laboratory Press, 2nd ed. 1988); Hammerling, et al., in: MONOCLONAL 
ANTIBODIES AND T-CELL HYBPJDOMAS 563-681 (Elsevier, N.Y., 1981). Fab and 
F(ab')2 fragments may be produced by proteolytic cleavage, using enzymes such as papain (to 
produce Fab fragments) or pepsin (to produce F(ab')2 fragments). Alternatively, B. burgdorferi 
polypeptide-binding fragments, chimeric, and humanized antibodies can be produced through the 
application of recombinant DNA technology or through synthetic chemistry using methods known 
in the art. 

Alternatively, additional antibodies capable of binding to the polypeptide antigen of the 
present invention may be produced in a two-step procedure through the use of anti-idiorypic 
antibodies. Such a method makes use of the fact that antibodies are themselves antigens, and that, 
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therefore, it is possible to obtain an antibody which binds to a second antibody. In accordance 
with this method, B. burgdorferi polypeptide-specific antibodies are used to immunize an animal, 
preferably a mouse. The splenocytes of such an animal are then used to produce hybridoma cells, 
and the hybridoma cells are screened to identify clones which produce an antibody whose ability 
to bind to the B. burgdorferi polypeptide-specific antibody can be blocked by the B. burgdorferi 
polypeptide antigen. Such antibodies comprise anti-idiotypic antibodies to the B. burgdorferi 
polypeptide-specific antibody and can be used to immunize an animal to induce formation of 
further B. burgdorferi polypeptide-specific antibodies. 

Antibodies and fragements thereof of the present invention may be described by the 
portion of a polypeptide of the present invention recognized or specifically bound by the antibody. 
Antibody binding fragements of a polypeptide of the present invention may be described or 
specified in the same manner as for polypeptide fragements discussed above., i.e, by N-terminal 
and C-terminal positions or by size in contiguous amino acid residues. Any number of antibody 
binding fragments, of a polypeptide of the present invention, specified by N-terminal and C- 
terminal positions or by size in amino acid residues, as described above, may also be excluded 
from the present invention. Therefore, the present invention includes antibodies the specifically 
bind a particuarlly discribed fragement of a polypeptide of the present invention and allows for the 
exclusion of the same. 

Antibodies and fragements thereof of the present invention may also be described or 
specified in terms of their cross-reactivity. Antibodies and fragements that do not bind 
polypeptides of any other species of Borrelia other than B. burgdorferi are included in the present 
invention. Likewise, antibodies and fragements that bind only species of Borrelia, i.e. antibodies 
and fragements that do not bind bacteria from any genus other than Borrelia, are included in the 
present invention. 

Diagnostic Assays 

The present invention further relates to methods for assaying staphylococcal infection in 
an animal by detecting the expression of genes encoding staphylococcal polypeptides of the 
present invention. The methods comprise analyzing tissue or body fluid from the animal for 
Borrelia-specific antibodies, nucleic acids, or proteins. Analysis of nucleic acid specific to 
Borrelia is assayed by PCR or hybridization techniques using nucleic acid sequences of the 
present invention 'as either hybridization probes or primers. See, e.g., Sambrook et al. Molecular 
cloning: A Laboratory Manual (Cold Spring Harbor Laboratory Press, 2nd ed., 1989, page 54 
reference); Eremeeva et al. (1994) J. Clin. Microbiol. 32:803-810 (describing differentiation 
among spotted fever group Rickettsiae species by analysis of restriction fragment length 
polymorphism of PCR-amplified DNA) and Chen et al. 1994 J. Clin. Microbiol. 32:589-595 
(detecting B. burgdorferi nucleic acids via PCR). 

Where diagnosis of a disease state related to infection with Borrelia has already been 
made, the present invention is useful for monitoring progression or regression of the disease state 
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whereby patients exhibiting enhanced Borrelia gene expression will experience a worse clinical 
outcome relative to patients expressing these gene(s) at a lower level. 

By "biological sample" is intended any biological sample obtained from an animal, cell 
line, tissue culture, or other source which contains Borrelia polypeptide, mRNA, or DNA. 
Biological samples include body fluids (such as saliva, blood, plasma, urine, mucus, synovial 
fluid, etc.) tissues (such as muscle, skin, and cartilage) and any other biological source suspected 
of containing Borrelia polypeptides or nucleic acids. Methods for obtaining biological samples 
such as tissue are well known in the art. 

The present invention is useful for detecting diseases related to Borrelia infections in 
animals. Preferred animals include monkeys, apes, cats, dogs, birds, cows, pigs, mice, horses, 
rabbits and humans. Particularly preferred are humans. 

Total RNA can be isolated from a biological sample using any suitable technique such as 
the single-step guanidinium-thiocyanate-phenol-chloroform method described in Chomczynski et 
al. (1987) Anal. Biochem. 162:156-159. mRNA encoding Borrelia polypeptides having sufficient 
homology to the nucleic acid sequences identified in Table 1 to allow for hybridization between 
complementary sequences are then assayed using any appropriate method. These include 
. Northern blot analysis, SI nuclease mapping, the polymerase chain reaction (PGR), reverse 
transcription in combination with the polymerase chain reaction (RT-PCR), and reverse 
transcription in combination with the ligase chain reaction (RT-LCR). 

Northern blot analysis can be performed as described in Harada et al. (1990) Cell 
63:303-312. Briefly, total RNA is prepared from a biological sample as described above. For the 
Northern blot, the RNA is denatured in an appropriate buffer (such as glyoxal/dimethyl 
sulfoxide/sodium phosphate buffer), subjected to agarose gel electrophoresis, and transferred 
onto a nitrocellulose filter. After the RNAs have been linked to the filter by a UV linker, the filter 
is prehybridized in a solution containing formamide, SSC, Denhardt's solution, denatured salmon 
sperm, SDS, and sodium phosphate buffer. A B. burgdorferi polynucleotide sequence shown in 
Table 1 labeled according to any appropriate method (such as the 32 P-multiprimed DNA labeling 
system (Amersham)) is used as probe. After hybridization overnight, the filter is washed and 
exposed to x-ray film. DNA for use as probe according to the present invention is described in 
the sections above and will preferably at least 15 nucleotides in length. 

SI mapping can be performed as described in Fujita et al. (1987) Cell 49:357-367. To 
prepare probe DNA for use in SI mapping, the sense strand of an above-described B. burgdorferi 
DNA sequence of the present invention is used as a template to synthesize labeled antisense DNA. 
The antisense DNA can then be digested using an appropriate restriction endonuclease to generate 
further DNA probes of a desired length. Such antisense probes are useful for visualizing 
protected bands corresponding to the target mRNA {i.e., mRNA encoding Borrelia polypeptides). 

Levels of mRNA encoding Borrelia polypeptides are assayed, for e.g., using the RT-PCR 
method described in Makino et al. (1990) Technique 2:295-301. By this method, the 
radioactivities of the "amplicons" in the polyacrylamide gel bands are linearly related to the initial 
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concentration of the target mRNA. Briefly, this method involves adding total RNA isolated from 
a biological sample in a reaction mixture containing a RT primer and appropriate buffer. After 
incubating for primer annealing, the mixture can be supplemented with a RT buffer, dNTPs, 
DTT, RNase inhibitor and reverse transcriptase. After incubation to achieve reverse transcription 
5 of the RNA, the RT products are then subject to PCR using labeled primers. Alternatively, rather 
than labeling the primers, a labeled dNTP can be included in the PCR reaction mixture. PCR 
amplification can be performed in a DNA thermal cycler according to conventional techniques. 
After a suitable number of rounds to achieve amplification, the PCR reaction mixture is 
electrophoresed on a polyacrylamide gel. After drying the gel, the radioactivity of the appropriate 

10 bands (corresponding to the mRNA encoding the Borrelia polypeptides of the present invention) 
are quantified using an imaging analyzer. RT and PCR reaction ingredients and conditions, 
reagent and gel concentrations, and labeling methods are well known in the art. Variations on the 
RT-PCR method will be apparent to the skilled artisan. Other PCR methods that can detect the 
nucleic acid of the present invention can be found in PCR PRIMER: A LABORATORY 

15 MANUAL (C.W. Dieffenbach et al. eds., Cold Spring Harbor Lab Press, 1995). 

The polynucleotides of the present invention, including both DNA and RNA, may be used 
to detect polynucleotides of the present invention or Borrelia species including B. burgdorferi 
using bio chip technology. The present invention includes both high density chip arrays (>1000 

m oligonucleotides per cm 2 ) and low density chip arrays (<1000 oligonucleotides per cm 2 ). Bio 

,ao chips comprising arrays of polynucleotides of the present invention may be used to detect Borrelia 
species, including B. burgdorferi, in biological and environmental samples and to diagnose an 
animal, including humans, with an B. burgdorferi or other Borrelia infection. The bio chips of 
the present invention may comprise polynucleotide sequences of other pathogens including 
bacteria, viral, parasitic, and fungal polynucleotide sequences, in addition to the polynucleotide 

25 sequences of the present invention, for use in rapid diffenertial pathogenic detection and 
diagnosis. The bio chips can also be used to monitor an B. burgdorferi or other Borrelia 
infections and to monitor the genetic changes (deletions, insertions, mismatches, etc.) in response 
to drug therapy in the clinic and drug development in the laboratory. The bio chip technology 
comprising arrays of polynucleotides of the present invention may also be used to simultaneously 

30 monitor the expression of a multiplicity of genes, including those of the present invention. The 
polynucleotides used to comprise a selected array may be specified in the same manner as for the 
fragements, i.e, by their 5' and 3' positions or length in contigious base pairs and include from. 
Methods and particular uses of the polynucleotides of the present invention to detect Borrelia • 
species, including B. burgdorferi, using bio chip technology include those known in the art and 

35 those of: U.S. Patent Nos. 5510270, 5545531, 5445934, 5677195, 5532128, 5556752, 
5527681, 5451683, 5424186, 5607646, 5658732 and World Patent Nos. WO/9710365, 
WO/95 1 1995, WO/9743447, WO/9535505, each incorporated herein in their entireties. 

Biosensors using the polynucleotides of the present invention may also be used to detect, 
diagnose, and monitor B. burgdorferi or other Borrelia species and infections thereof. 



Biosensors using the, polynucleotides of the present invention may also be used to detect particular 
polynucleotides of the present invention. Biosensors using the polynucleotides of the present 
invention may also be used to monitor the genetic changes (deletions, insertions, mismatches, 
etc.) in response to drug therapy in the clinic and drug development in the laboratory. Methods 
and particular uses of the polynucleotides of the present invention to detect Borrelia species, 
including B. burgdorferi, using biosenors include those known in the art and those of: U.S. 
Patent Nos 5721 102, 5658732, 5631 170, and "World Patent Nos. WO97/3501 1, WO/9720203, 
each incorporated herein in their entireties. 

Thus, the present invention includes both bio chips and biosensors comprising 
polynucleotides of the present invention and methods of their use. 

Assaying Borrelia polypeptide levels in a biological sample can occur using any art-known 
method, such as antibody-based techniques. For example, Borrelia polypeptide expression in 
tissues can be studied with classical immunohistological methods. In these, the specific 
_ recognition is provided by the primary antibody {polyclonal or monoclonal) but the secondary 
detection system can utilize fluorescent, enzyme, or other conjugated secondary antibodies. As a 
result, an immunohistological staining of tissue section for pathological examination is obtained. 
Tissues can also be extracted, e.g., with urea and neutral detergent, for the liberation of Borrelia 
polypeptides for Western-blot or dot/slot assay. See, e.g., Jalkanen, M. et al. (1985) J. Cell. 
Biol. 101:976-985; Jalkanen, M. et al. (1987) J. Cell . Biol. 105:3087-3096. In this technique, 
which is based on the use of cationic solid phases, quantitation of a Borrelia polypeptide can be 
accomplished using an isolated Borrelia polypeptide as a standard. This technique can also be 
applied to body fluids. 

Other antibody-based methods useful for detecting Borrelia polypeptide gene expression 
include immunoassays, such as the ELISA and the radioimmunoassay (RIA). For example, a 
Borrelia polypeptide-specific monoclonal antibodies can be used both as an immunoabsorbent and 
as an enzyme-labeled probe to detect and quantify a Borrelia polypeptide. The amount of a 
Borrelia polypeptide present in the sample can be calculated by reference to the amount present in 
a standard preparation using a linear regression computer algorithm. Such an ELISA is described 
in Iacobelli et al. ( 1 988) Breast Cancer Research and Treatment 11:1 9-30. In another ELISA 
assay, two distinct specific monoclonal antibodies can be used to detect Borrelia polypeptides in a 
body fluid. In this assay, one of the antibodies is used as the immunoabsorbent and the other as 
the enzyme-labeled probe. . 

The above techniques may be conducted essentially as a "one-step" or "two-step" assay. 
The "one-step" assay involves contacting the Borrelia polypeptide with immobilized antibody and, 
without washing, contacting the mixture with the labeled antibody. The "two-step" assay 
involves washing before contacting the mixture with the labeled antibody. Other conventional 
methods may also be employed as suitable. It is usually desirable to immobilize one component 
of the assay system on a support, thereby allowing other components of the system to be brought 
into contact with the component and readily removed from the sample. Variations of the above 
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and other immunological methods included in the present invention can also be found in Harlow 
et al., ANTIBODIES: A LABORATORY MANUAL, (Cold Spring Harbor Laboratory Press, 
2nd ed. 1988). 

Suitable enzyme labels include, for example, those from the oxidase group, which 
catalyze the production of hydrogen peroxide by reacting with substrate. Glucose oxidase is 
particularly preferred as it has good stability and its substrate (glucose) is readily available. 
Activity of an oxidase label may be assayed by measuring the concentration of hydrogen peroxide 
formed by the enzyme-labeled antibody/substrate reaction. Besides enzymes, other suitable labels 
include radioisotopes, such as iodine ( 125 I, 12! I), carbon ( 14 C), sulphur ( 35 S), tritium ( 3 H), indium 
( I12 In), and technetium ( 99m Tc), and fluorescent labels, such as fluorescein and rhodamine, and 
biotin. 

Further suitable labels for the Borrelia polypeptide-specific antibodies of the present 
invention are provided below. Examples of suitable enzyme labels include malate dehydrogenase, 
Borrelia nuclease, delta-5-steroid isomerase, yeast-alcohol dehydrogenase, alpha-glycerol 
phosphate dehydrogenase, triose phosphate isomerase, peroxidase, alkaline phosphatase, 
asparaginase, glucose oxidase, beta-galactosidase, ribonuclease, urease, catalase, 
glucose-6-phosphate dehydrogenase, glucoamylase, and acetylcholine esterase. 

Examples of suitable radioisotopic labels include 3 H, IU In, 125 I, 13I I, 32 P, 35 S, 14 C, 5I Cr, 
57 To, 5S Co, 59 Fe, 75 Se, !52 Eu, 90 Y, 67 Cu, 2I7 Ci, 2,1 At, 212 Pb, 47 Sc, ,09 Pd, etc. H, In is a preferred 
isotope where in vivo imaging is used since its avoids the problem of dehalogenation of the I25 I or 
131 I-labeled monoclonal antibody by the liver. In addition, this radionucleotide has a more 
favorable gamma emission energy for imaging. See, e.g., Perkins et al. (1985) Eur. J. Nucl. 
Med. 10:296-301; Carasquillo et al. (1987) J. Nucl. Med. 28:281-287. For example, m In 
coupled to monoclonal antibodies with l-(P-isothiocyanatobenzyl)-DPTA has shown little uptake 
in non-tumors tissues, particularly the liver, and therefore enhances specificity of tumor 
localization. See, Esteban et al. (1987) J. Nucl. Med. 28:861-870. 

Examples of suitable non-radioactive isotopic labels include 157 Gd, 55 Mn, I62 Dy, 52 Tr, and 

56 Fe. 

Examples of suitable fluorescent labels include an l52 Eu label, a fluorescein label, an 
isothiocyanate label, a rhodamine label, a phycoerythrin label, a phycocyanin label, an 
allophycocyanin label, an o-phthaldehyde label, and a fluoresc amine label. 

Examples of suitable toxin labels include, Pseudomonas toxin, diphtheria toxin, ricin, and 
cholera toxin. 

Examples of chemiluminescent labels include a luminal label, an isoluminal label, an 
aromatic acridinium ester label, an imidazole label, an acridinium salt label, an oxalate ester label, 
a luciferin label, a luciferase label, and an aequorin label. 

Examples of nuclear magnetic resonance contrasting agents include heavy metal nuclei 
such as Gd, Mn, and iron. 

Typical techniques for binding the above-described labels to antibodies are provided by 




Kennedy et al. (1976) Clin. Chim. Acta 70:1-31, and Schurs et al. (1977) Clin. Chim. Acta 
81: 1-40. Coupling techniques mentioned in the latter are the glutaraldehyde method, the periodate 
method, the dimaleimide method, the m-maleirnidobenzyl-N-hydroxy-succinirnide ester method, 
all of which methods are incorporated by reference herein. 
5 In a related aspect, the invention includes a diagnostic kit for use in screening serum 

containing antibodies specific against B. burgdorferi infection. Such a kit may include an 
isolated B. burgdorferi antigen comprising an epitope which is specifically immunoreactive with 
at least one anti-2?. burgdorferi antibody. Such a kit also includes means for detecting the 
binding of said antibody to the antigen. In specific embodiments, the kit may include a 
10 recombinantiy produced or chemically synthesized peptide or polypeptide antigen. The peptide or 
polypeptide antigen may be attached to a solid support. 

In a more specific embodiment, the detecting means of the above-described kit includes a 
5 soiid su PP ort to which said peptide or polypeptide antigen is attached. Such a kit may also 
\U - include a non-attached reporter-labeled anti-human antibody. In this embodiment, binding of the 
pf5 antibody to the B. burgdorferi antigen can be detected by binding of the reporter labeled antibody 
flj to the anti-B. burgdorferi polypeptide antibody. 

In a related aspect, the invention includes a method of detecting B. burgdorferi infection in 
a subject. This detection method includes reacting a body fluid, preferably serum, from the 
'T:f subject with an isolated B. burgdorferi antigen, and examining the antigen for the presence of 
^;20 bound antibody. In a specific embodiment, the method includes a polypeptide antigen attached to 
\l a solid support, and serum is reacted with the support. Subsequently, the support is reacted with 
a reporter-labeled anti-human antibody. The support is then examined for the presence of 
reporter-labeled antibody. 

The solid surface reagent employed in the above assays and kits is prepared by known 
25 techniques for attaching protein material to solid support material, such as polymeric beads, dip 
sticks, 96- well plates or filter material. These attachment methods generally include non-specific 
adsorption of the protein to the support or covalent attachment of the protein , typically through a 
free amine group, to a chemically reactive group on the solid support, such as an activated 
carboxyl, hydroxyl, or aldehyde group. Alternatively, streptavidin coated plates can be used in 
30 conjunction with biotinylated antigen(s). 

The polypeptides and antibodies of the present invention, including fragments thereof, 
may be used to detect Borrelia species including B. burgdorferi using bio chip and biosensor 
technology. Bio chip and biosensors of the present invention may comprise the polypeptides of 
the present invention to detect antibodies, which specifically recognize Borrelia species, including 
35 B. burgdorferi. Bio chip and biosensors of the present invention may also comprise antibodies 
which specifically recognize the polypeptides of the present invention to detect Borrelia species, 
including B. burgdorferi or specific polypeptides of the present invention. Bio chips or 
biosensors comprising polypeptides or antibodies of the present invention may be used to detect 
Borrelia species, including B. burgdorferi, in biological and environmental samples and to 
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diagnose an animal, including humans, with an B. burgdorferi or other Borrelia infection. Thus, 
the present invention includes both bio chips and biosensors comprising polypeptides or 
antibodies of the present invention and methods of their use. 

The bio chips of the present invention may further comprise polypeptide sequences of 
other pathogens including bacteria, viral, parasitic, and fungal polypeptide sequences, in addition 
to the polypeptide sequences of the present invention, for use in rapid diffenertial pathogenic 
detection and diagnosis. The bio chips of the present invention may further comprise antibodies 
or fragements thereof specific for other pathogens including bacteria, viral, parasitic, and fungal 
polypeptide sequences, in addition to the antibodies or fragements thereof of the present 
invention, for use in rapid diffenertial pathogenic detection and diagnosis. The bio chips and 
biosensors of the present invention may also be used to monitor an B. burgdorferi or other 
Borrelia infection and to monitor the genetic changes (amio acid deletions, insertions, 
substitutions, etc.) in response to drug therapy in the clinic and drug development in the 
laboratory. The bio chip and biosensors comprising polypeptides or antibodies of the present 
invention may also be used to simultaneously monitor the expression of a multiplicity of 
polypeptides, including those of the present invention. The polypeptides used to comprise a bio 
chip or biosensor of the present invention may be specified in the same manner as for the 
fragements, i.e, by their N-terminal and C-terminal positions or length in contigious amino acid 
residue. Methods and particular uses of the polypeptides and antibodies of the present invention 
to detect Borrelia species, including B. burgdorferi, or specific polypeptides using bio chip and 
biosensor technology include those known in the art, those of the U.S. Patent Nos. and World 
Patent Nos. listed above for bio chips and biosensors using polynucleotides of the present 
invention, and those of: U.S. Patent Nos. 5658732, 5135852, 5567301, 5677196, 5690894 and 
World Patent Nos. W09729366, W09612957, each incorporated herein in their entireties. 

Treatment: 

Agonists and Antagonists - Assays and Molecules 

The invention also provides a method of screening compounds to identify those which 
enhance or block the biological activity of the B. burgdorferi polypeptides of the present 
invention. The present invention further provides where the compounds kill or slow the growth 
of B. burgdorferi. The ability of B. burgdorferi antagonists, including B. burgdorferi ligands, to 
prophylactically or therapeutically block antibiotic resistance may be easily tested by the skilled 
artisan. See, e.g., Straden et al. (1997) J Bacteriol. 179(1):9-16. 

An agonist is a compound which increases the natural biological function or which 
functions in a manner similar to the polypeptides of the present invention, while antagonists . 
decrease or eliminate such functions. Potential antagonists include small organic molecules, 
peptides, polypeptides, and antibodies that bind to a polypeptide of the invention and thereby 
inhibit or extinguish its activity. 

The antagonists may be employed for instance to inhibit peptidoglycan cross bridge 
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formation. Antibodies against B. burgdorferi may be employed to bind to and inhibit B. 
burgdorferi activity to treat antibiotic resistance. Any of the above antagonists may be employed 
in a composition with a pharmaceutically acceptable carrier. 

Vaccines 

The present invention also provides vaccines comprising one or more polypeptides of the 
present invention. Heterogeneity in the composition of a vaccine may be provided by combining 
B. burgdorferi polypeptides of the present invention. Multi-component vaccines of this type are 
desirable because they are likely to be more effective in eliciting protective immune responses 
against multiple species and strains of the Borrelia genus than single polypeptide vaccines. Thus, 
as discussed in detail below, a multi-component vaccine of the present invention may contain one 
or more, preferably 2 to about 20, more preferably 2 to about 15, and most preferably 3 to about 
8, of the B. burgdorferi polypeptides shown in Table 1, or fragments thereof. 

Multi -component vaccines are known in the art to elicit antibody production to numerous 
immunogenic components. Decker, M. and Edwards, K., J. Infect. Dis. 174:S210-275 (1996). 
In addition, a hepatitis B, diphtheria, tetanus, pertussis tetravalent vaccine has recently been 
demonstrated to elicit protective levels of antibodies in human infants against all four pathogenic 
agents. Aristegui, J. et ah, Vaccine 25:7-9 (1997). 

The present invention thus also includes multi-component vaccines. These vaccines 
comprise more than one polypeptide, immunogen or antigen. An example of such a multi- 
component vaccine would be a vaccine comprising more than one of the B. burgdorferi 
polypeptides shown in Table 1. A second example is a vaccine comprising one or more, for 
example 2 to 10, of the B. burgdorferi polypeptides shown in Table 1 and one or more, for 
example 2 to 10, additional polypeptides of either borrelial or non-borrelial origin. Thus, a multi- 
component vaccine which confers protective immunity to both a borrelial infection and infection 
by another pathogenic agent is also within the scope of the invention. 

As indicated above, the vaccines of the present invention are expected to elicit a protective 
immune response against infections caused by species and strains of Borrelia other than B. 
burgdorferi sensu stricto isolate B31 (ATCC Accession No. 35210). Immunizations using 
decorin-binding protein and OspA derived from one strain of B. burgdorferi has been shown to 
elicit the production of antiserum which confers passive immunity against other strains of B. 
burgdorferi. Cassatt, D. et at, Protection of Borrelia burgdorferi Infection by Antibodies to 
Decorin-binding Protein, in Vaccines97, Cold Spring Harbor Press (1997), pages 191-195. 
Further, the inventors have found using an in vitro assay that antiserum produced in response to 
B. burgdorferi decorin-binding protein will kill several species of Borrelia. The amino acid 
sequences of decorin-binding protein expressed by different strains of B. burgdorferi are 
believed to diverge by as much as 25%. Thus, antisera elicited against decorin-binding proteins 
confers passive immunity against Borrelia expressing proteins having only 75% or less amino 
acid sequence similarity. 
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Further within the scope of the invention are whole cell and whole viral vaccines. Such 
vaccines may be produced recombinantly and involve the expression of one or more of the 
B. burgdorferi polypeptides shown in Table 1. For example, the B. burgdorferi polypeptides of 
the present invention may be either secreted or localized intracellular, on the cell surface, or in the 
periplasmic space. Further, when a recombinant virus is used, the B. burgdorferi polypeptides 
of the present invention may, for example, be localized in the viral envelope, on the surface of the 
capsid, or internally within the capsid. Whole cells vaccines which employ cells expressing 
heterologous proteins are known in the art. See, e.g., Robinson, K. et al, Nature Biotech. 
15:653-651 (1997); Sirard, J. et al, Infect. Immun. 55:2029-2033 (1997); Chabalgoity, J. et al, 
Infect. Immun. 55:2402-2412 (1997). These cells may be administered live or may be killed 
prior to administration. Chabalgoity, J. et al, supra, for example, report the successful use in 
mice of a live attenuated Salmonella vaccine strain which expresses a portion of a platyhelminth 
fatty acid-binding protein as a fusion protein on its cells surface. 

A multi-component vaccine can also be prepared using techniques known in the art by 
combining one or more B. burgdorferi polypeptides of the present invention, or fragments 
thereof, with additional non-borrelial components {e.g., diphtheria toxin or tetanus toxin, and/or 
other compounds known to elicit an immune response). Such vaccines are useful for eliciting 
protective immune responses to both members of the Borrelia genus and non-borrelial pathogenic 
agents. 

The vaccines of the present invention also include DNA vaccines. DNA vaccines are 
currently being developed for a number of infectious diseases. Boyer, J et al, Nat. Med. 3:526- 
532 (1997); reviewed in Spier, R., Vaccine 74:1285-1288 (1996). Such DNA vaccines contain a 
nucleotide sequence encoding one or more B. burgdorferi polypeptides of the present invention 
oriented in a manner that allows for expression of the subject polypeptide. The direct 
administration of plasmid DNA~encoding OspA has been shown to elicit protective immunity in 
mice against borrelial challenge. Luke, C. et al, J. Infect. Dis. 175:91-91 (1997). 

The present invention also relates to the administration of a vaccine which is 
co-administered with a molecule capable of modulating immune responses. Kim, J. et al, Nature 
Biotech. 25:641-646 (1997), for example, report the enhancement of immune responses produced 
by DNA immunizations when DNA sequences encoding molecules which stimulate the immune 
response are co-administered. In a similar fashion, the vaccines of the present invention may be 
co-administered with either nucleic acids encoding immune modulators or the immune modulators 
themselves. These immune modulators include granulocyte macrophage colony stimulating factor 
(GM-CSF) and CD86. 

The vaccines of the present invention may be used to confer resistance to borrelial 
infection by either passive or active immunization. When the vaccines of the present invention are 
used- to confer resistance to borrelial infection through active immunization, a vaccine of the 
present invention is administered to an animal to elicit a protective immune response which either 
prevents or attenuates a borrelial infection. When the vaccines of the present invention are used to 
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confer resistance to borrelial infection tiirough passive immunization, the vaccine is provided to a 
host animal {e.g., human, dog, or mouse), and the antisera elicited by this antisera is recovered 
and directly provided to a recipient suspected of having an infection caused by a member of the 
Borrelia genus. 

The ability to label antibodies, or fragments of antibodies, with toxin molecules provides 
an additional method for treating borrelial infections when passive immunization is conducted. In 
this embodiment, antibodies, or fragments of antibodies, capable of recognizing the 
B. burgdorferi polypeptides disclosed herein, or fragments thereof, as well as other Borrelia 
proteins, are labeled with toxin molecules prior to their administration to the patient. When such 
toxin derivatized antibodies bind to Borrelia cells, toxin moieties will be localized to these cells 
and will cause their death. 

The present invention thus concerns and provides a means for preventing or attenuating a 
borrelial infection resulting from organisms which have antigens that are recognized and bound by 
antisera produced in response to the polypeptides of the present invention. As used herein, a 
vaccine is said to prevent or attenuate a disease if its administration to an animal results either in 
the total or partial attenuation {i.e., suppression) of a symptom or condition of the disease, or in 
the total or partial immunity of the animal to the disease. 

The administration of the vaccine (or the antisera which it elicits) may be for either a 
"prophylactic" or "therapeutic" purpose. When provided prophylactically, the compound(s) are 
provided in advance of any symptoms of borrelial infection. The prophylactic administration of 
the compound(s) serves to prevent or attenuate any subsequent infection. When provided 
therapeutically, the compound(s) is provided upon or after the detection of symptoms which 
indicate that an animal may be infected with a member of the Borrelia genus. The therapeutic 
administration of the compound(s) serves to attenuate any actual infection. Thus, the 
B. burgdorferi polypeptides, and fragments thereof, of the present invention may be provided 
either prior to the onset of infection (so as to prevent or attenuate an anticipated infection) or after 
the initiation of an actual infection. 

The polypeptides of the invention, whether encoding a portion of a native protein or a 
functional derivative thereof, may be administered in pure form or may be coupled to a 
macromolecular carrier. Example of such carriers are proteins and carbohydrates. Suitable 
proteins which may act as macromolecular carrier for enhancing the immunogenicity of the 
polypeptides of the present invention include keyhole limpet hemacyanin (KLH) tetanus toxoid, 
pertussis toxin, bovine serum albumin, and ovalbumin. Methods for coupling the polypeptides of 
the present invention to such macromolecular carriers are disclosed in Harlow et al, Antibodies: 
A Laboratory Manual, 2nd Ed.; Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New 
York (1988), the entire disclosure of which is incorporated by reference herein. 

A composition is said to be "pharmacologically acceptable" if its administration can be 
tolerated by a recipient animal and is otherwise suitable for administration to that animal. Such an 
agent is said to be administered in a "therapeutically effective amount" if the amount administered 
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is physiologically significant. An agent is physiologically significant if its presence results in a 
detectable change in the physiology of a recipient patient. 

While in all instances the vaccine of the present invention is administered as a 
pharmacologically acceptable compound, one skilled in the art would recognize that the 
composition of a pharmacologically acceptable compound varies with the animal to which it is 
administered. For example, a vaccine intended for human use will generally not be co- 
administered with Freund's adjuvant. Further, the level of purity of the B. burgdorferi 
polypeptides of the present invention will normally be higher when administered to a human than 
when administered to a non-human animal. 

As would be understood by one of ordinary skill in the art, when the vaccine of the 
present invention is provided to an animal, it may be in a composition which may contain salts, 
buffers, adjuvants, or other substances which are desirable for improving the efficacy of the 
composition. Adjuvants are substances that can be used to specifically augment a specific 
immune response. These substances generally perform two functions: (1) they protect the 
antigen(s) from being rapidly catabolized after administration and (2) they nonspecifically 
stimulate immune responses. 

Normally, the adjuvant and the composition are mixed prior to presentation to the immune 
system, or presented separately, but into the same site of the animal being immunized. Adjuvants 
can be loosely divided into several groups based upon their composition. These groups include 
oil adjuvants (for example, Freund's complete and incomplete), mineral salts (for example, 
A1K(S0 4 ) 2 , AlNa(S0 4 ) 2 , A1NH 4 (S0 4 ), silica, kaolin, and carbon), polynucleotides (for example, 
poly IC and poly AU acids), and certain natural substances (for example, wax D from 
Mycobacterium tuberculosis, as well as substances found in Corynebacterium parvum, or 
Bordetella pertussis, and members, of the genus Brucella. Other substances useful as adjuvants 
are the saponins such as, for example, Quil A. (Superfos A/S, Denmark). Preferred adjuvants for 
use in the present invention include aluminum salts, such as A1K(S0 4 ) 2 , AlNa(S0 4 ) 2 , and 
A1NH 4 (S0 4 ). Examples of materials suitable for use in vaccine compositions are provided in 
Remington's Pharmaceutical Sciences (Osol, A, Ed, Mack Publishing Co, Easton, PA, pp. 1324- 
1341 (1980), which reference is incorporated herein by reference). 

The therapeutic compositions of the present invention can be administered parenterally by 
injection, rapid infusion, nasopharyngeal absorption (intranasopharangeally), dermoabsorption, 
or orally. The compositions may alternatively be administered intramuscularly, or intravenously. 
Compositions for parenteral administration include sterile aqueous or non-aqueous solutions, 
suspensions, and emulsions. Examples of non-aqueous solvents are propylene glycol, 
polyethylene glycol, vegetable oils such as olive oil, and injectable organic esters such as ethyl 
oleate. Carriers or occlusive dressings can be used to increase skin permeability and enhance 
antigen absorption. Liquid dosage forms for oral administration may generally comprise a 
liposome solution containing the liquid dosage form. Suitable forms for suspending liposomes 
include emulsions, suspensions, solutions, syrups, and elixirs containing inert diluents 




commonly used in the art, such as purified water. Besides the inert diluents, such compositions 
can also include adjuvants, wetting agents, emulsifying and suspending agents, or sweetening, 
flavoring, or perfuming agents. 

Therapeutic compositions of the present invention can also be administered in 
encapsulated form. For example, intranasal immunization of mice against Bordetella pertussis 
infection using vaccines encapsulated in biodegradable microsphere composed of poly(DL-lactide- 
co-glycolide) has been shown to stimulate protective immune responses. Shahin, R. et al, 
Infect. Immun. 55:1195-1200 (1995). Similarly, orally administered encapsulated Salmonella 
typhimurium antigens have also been shown to elicit protective immunity in mice. Allaoui- 
Attarki, K. et al, Infect. Immun. (55:853-857 (1997). Encapsulated vaccines of the present 
invention can be administered by a variety of routes including those involving contacting the 
vaccine with mucous membranes {e.g., intranasally, intracolonicly, intraduodenally). 

Many different techniques exist for the timing of the immunizations when a multiple 
administration regimen is utilized. It is possible to use the compositions of the invention more 
than once to increase the levels and diversities of expression of the immunoglobulin repertoire 
expressed by the immunized animal. Typically, if multiple immunizations are given, they will be 
given one to two months apart. 

According to the present invention, an "effective amount" of a therapeutic composition is 
one which is sufficient to achieve a desired biological effect. Generally, the dosage needed to 
provide an effective amount of the composition will vary depending upon such factors as the 
animal's or human's age, condition, sex, and extent of disease, if any, and other variables which 
can be adjusted by one of ordinary skill in the art. 

The antigenic preparations of the invention can be administered by either single or multiple 
dosages of an effective amount. Effective amounts of the compositions of the invention can vary 
from 0.01-1,000 uvg/ml per dose, more preferably 0.1-500 ug/ml per dose, and most preferably 
10-300 ug/ml per dose. 

Having now generally described the invention, the same will be more readily understood 
through reference to the following example which is provided by way of illustration, and is not 
intended to be limiting of the present invention, unless specified. 

Examples 

1. Preparation of PCR Primers and Amplification of DNA 
Various fragments of the Borrelia burgdorferi genome, such as those of Table 1, can be 
used, in accordance with the present invention, to prepare PCR primers for a variety of uses. The 
PCR primers are preferably at least 15 bases, and more preferably at least 18 bases in length. 
When selecting a primer sequence, it is preferred that the primer pairs have approximately the 
same G/C ratio, so that melting temperatures are approximately the same. The PCR primers and 
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amplified DNA of this Example find use in the Examples that follow. 

2. Isolation of a Selected DNA Clone From B. burgdorferi 

Three approaches are used to isolate a B. burgdorferi clone comprising a polynucleotide of 
5 the present invention from any B. burgdorferi genomic DNA library. The B. burgdorferi strain 
B3 1PU has been deposited as a convienent source for obtaining a B. burgdorferi strain although a 
wide varity of strains B. burgdorferi strains can be used which are known in the art. 

B. burgdorferi genomic DNA is prepared using the following method. A 20ml overnight 
bacterial culture grown in a rich medium (e.g., Trypticase Soy Broth, Brain Heart Infusion broth 
10 or Super broth), pelleted, ished two times with TES (30mM Tris-pH 8.0, 25mM EDTA, 50mM 
NaCl), and resuspended in 5ml high salt TES (2.5M NaCl). Lysostaphin is added to final 
concentration of approx 50ug/ml and the mixture is rotated slowly 1 hour at 37C to make 
Q protoplast cells. The solution is then placed in incubator (or place in a shaking water bath) and 
warmed to 55C. Five hundred micro liter of 20% sarcosyl in TES (final concentration 2%) is 
b|5 then added to lyse the cells. Next, guanidine HC1 is added to a final concentration of 7M (3.69g 
™j in 5.5 ml). The mixture is swirled slowly at 55C for 60-90 min (solution should clear). A CsCl 
y gradient is then set up in SW41 ultra clear tubes using 2.0ml 5.7M CsCl and overlaying with 
O 2.85M CsCl. The gradient is carefully overlayed with the DNA-containing GuHCl solution. The 
t n gradient is spun at 30,000 rpm, 20C for 24 hr and the lower DNA band is collected. The volume 
jl$0 is increased to 5 ml with TE buffer. The DNA is then treated with protease K (10 ug/ml) 
: overnight at 37 C, and precipitated with ethanol. The precipitated DNA is resuspended in a 

q desired buffer. 

In the first method, a plasmid is directly isolated by screening a plasmid B. burgdorferi 
genomic DNA library using a polynucleotide probe corresponding to a polynucleotide of the 
25 present invention. Particularly ,~a specific polynucleotide with 30-40 nucleotides is synthesized 
using an Applied Biosystems DNA synthesizer according to the sequence reported. The 
oligonucleotide is labeled, for instance, with 32 P-y-ATP using T4 polynucleotide kinase and 
purified according to routine methods. {See, e.g., Maniatis et al., Molecular Cloning: A 
Laboratory Manual, Cold Spring Harbor Press, Cold Spring, NY (1982).) The library is 
30 transformed into a suitable host, as indicated above (such as XL-1 Blue (Stratagene)) using 
techniques known to those of skill in the art. See, e.g., Sambrook et al. MOLECULAR 
CLONING: A LABORATORY MANUAL (Cold Spring Harbor, N.Y. 2nd ed. 1989); Ausubel et 
al., CURRENT PROTOCALS IN MOLECULAR BIOLOGY (John Wiley and Sons, N.Y. 
1989). The transformants are plated on 1.5% agar plates (containing the appropriate selection 
35 agent, e.g., ampicillin) to a density of about 150 transformants (colonies) per plate. These plates 
are screened using Nylon membranes according to routine methods for bacterial colony screening. 
See, e.g., Sambrook et al. MOLECULAR CLONING: A LABORATORY MANUAL (Cold 
Spring Harbor, N.Y. 2nd ed. 1989); Ausubel et al., CURRENT PROTOCALS IN 
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MOLECULAR BIOLOGY (John Wiley and Sons, N.Y. 1989) or other techniques known to 
those of skill in the art. 

Alternatively, two primers of 15-25 nucleotides derived from the 5' and 3' ends of a 
polynucleotide of Table 1 are synthesized and used to amplify the desired DNA by PCR using a 
5 B. burgdorferi genomic DNA prep as a template. PCR is carried out under routine conditions, for 
instance, in 25 jil of reaction mixture with 0.5 ug of the above DNA template. A convenient 
reaction mixture is 1.5-5 mM MgCl 2 , 0.01% (w/v) gelatin, 20 jiM each of dATP, dCTP, dGTP, 
dTTP, 25 pmol of each primer and 0.25 Unit of Taq polymerase. Thirty five cycles of PCR 
(denaturation at 94°C for 1 min; annealing at 55°C for 1 min; elongation at 72°C for 1 min) are 
10 performed with a Perkin-Elmer Cetus automated thermal cycler. The amplified product is 

analyzed by agarose gel electrophoresis and the DNA band with expected molecular weight is 
excised and purified. The PCR product is verified to be the selected sequence by subcloning and 
3 sequencing the DNA product. 

f |:| Finally, overlapping oligos of the DNA sequences of Table 1 can be chemically 

M|5 synthesized and used to generate a nucleotide sequence of desired length using PCR methods 

li\ known in the art. 

ilj 

3(a). Expression and Purification Borrelia polypeptides in E. coli 

Q The bacterial expression vector pQE60 is used for bacterial expression of some of the 

|fe> polypeptide fragements of the present invention. (QIAGEN, Inc., 9259 Eton Avenue, 
Q Chatsworth, CA, 91311). pQE60 encodes ampicillin antibiotic resistance ("Ampr") and contains 
Q a bacterial origin of replication ("ori"), an DFTG inducible promoter, a ribosome binding site 
("RBS"), six codons encoding histidine residues that allow affinity purification using nickel- 

nitrilo-tri-acetic acid ("Ni-NTA") affinity resin (QIAGEN, Inc., supra) and suitable single 
25 restriction enzyme cleavage sites. These elements are arranged such that an inserted DNA 

fragment encoding a polypeptide expresses that polypeptide with the six His residues (i.e., a "6 X 

His tag") covalently linked to the carboxyl terminus of that polypeptide. 

The DNA sequence encoding the desired portion of a B. burgdorferi protein of the present 

invention is amplified from B. burgdorferi genomic DNA using PCR oligonucleotide primers 
30 which- anneal to the 5' and 3 ' sequences coding for the portions of the B. burgdorferi 

polynucleotide shown in Table 1. Additional nucleotides containing restriction sites to facilitate 

cloning in the pQE60 vector are added to the 5' and 3' sequences, respectively. 

For cloning the mature protein, the 5' primer has a sequence containing an appropriate 

restriction site followed by nucleotides of the amino terminal coding sequence of the desired B. 
35 burgdorferi polynucleotide sequence in Table 1 . One of ordinary skill in the art would appreciate 

that the point in the protein coding sequence where the 5' and 3' primers begin may be varied to 

amplify a DNA segment encoding any desired portion of the complete protein shorter or longer 

than the mature form. The 3' primer has a sequence containing an appropriate restriction site 
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followed by nucleotides complementary to the 3' end of the polypeptide coding sequence of Table 
1, excluding a stop codon, with the coding sequence aligned with the restriction site so as to 
maintain its reading frame with that of the six His codons in the pQE60 vector. 

The amplified B. burgdorferi DNA fragment and the vector pQE60 are digested with 
restriction enzymes which recognize the sites in the primers and the digested DNAs are then 
ligated together. The B. burgdorferi DNA is inserted into the restricted pQE60 vector in a manner 
which places the B. burgdorferi protein coding region downstream from the IPTG-inducible 
promoter and in-frame with an initiating AUG and the six histidine codons. 

The ligation mixture is transformed into competent E. coli cells using standard procedures 
such as those described by Sambrook et al., supra.. E. coli strain M15/rep4, containing multiple 
copies of the plasmid pREP4, which expresses the lac repressor and confers kanamycin resistance 
("Kanr"), is used in carrying out the illustrative example described herein. This strain, which is 
only one of many that are suitable for expressing a B. burgdorferi polypeptide, is available 
commercially (QIAGEN, Inc., supra). Transformants are identified by their ability to grow on 
LB agar plates in the presence of ampicillin and kanamycin. Plasmid DNA is isolated from 
resistant colonies and the identity of the cloned DNA confirmed by restriction analysis, PCR and 
DNA sequencing. 

Clones containing the desired constructs are grown overnight ("O/N") in liquid culture in 
LB media supplemented with both ampicillin (100 jig/ml) and kanamycin (25 fig/ml). The O/N 
culture is used to inoculate a large culture, at a dilution of approximately 1:25 to 1:250. The cells 
are grown to an optical density at 600 nm ("OD600") of between 0.4 and 0.6. Isopropyl-fj-D- 
thiogalactopyranoside ("IPTG") is then added to a final concentration of 1 mM to induce 
transcription from the lac repressor sensitive promoter, by inactivating the lad repressor. Cells 
subsequently are incubated further for 3 to 4 hours. Cells then are harvested by centrifugation. 

The cells are then stirred for 3-4 hours at 4°C in 6M guanidine-HCl, pH 8. The cell 
debris is removed by centrifugation, and the supernatant containing the B. burgdorferi 
polypeptide is loaded onto a nickel-nitrilo-tri-acetic acid ("Ni-NTA") affinity resin column 
(QIAGEN, Inc., supra). Proteins with a 6 x His tag bind to the Ni-NTA resin with high affinity 
are purified in a simple one-step procedure (for details see: The QIAexpressionist, 1995, 
QIAGEN, Inc., supra). Briefly the supernatant is loaded onto the column in 6 M guanidine-HCl, 
pH 8, the column is first washed with 10 volumes of 6 M guanidine-HCl, pH 8, then washed 
with 10 volumes of 6 M guanidine-HCl pH 6, and finally the B. burgdorferi polypeptide is eluted 
with 6 M guanidine-HCl, pH 5. 

- " The purified protein is then renatured by dialyzing it against phosphate-buffered saline 
(PBS) or 50 mM Na-acetate, pH 6 buffer plus 200 mM NaCl. Alternatively, the protein could be 
successfully refolded while immobilized on the Ni-NTA column. The recommended conditions 
are as follows: renature using a linear 6M-1M urea gradient in 500 mM NaCl, 20% glycerol, 20 
mM Tris/HCl pH 7.4, containing protease inhibitors. The renaturation should be performed over 




a period of 1 .5 hours or more. After renaturation the proteins can be eluted by the addition of 250 
mM immidazole. Irnmidazole is removed by a final dialyzing step against PBS or 50 mM sodium 
acetate pH 6 buffer plus 200 mM NaCl. The purified protein is stored at 4° C or frozen at -80° C. 

The polypeptide of the present invention are also prepared using a non-denaturing protein 
purification method. For these polypeptides, the cell pellet from each liter of culture is 
resuspended in 25 mis of Lysis Buffer A at 4°C (Lysis Buffer A = 50 mM Na-phosphate, 300 
mM NaCl, 10 mM 2-mercaptoethanol, 10% Glycerol, pH 7.5 with 1 tablet of Complete EDTA- 
free protease inhibitor cocktail (Boehringer Mannheim #1873580) per 50 ml of buffer). 
Absorbance at 550 nm is approximately 10-20 O.D./ml. The suspension is then put through three 
freeze/thaw cycles from -70°C (using a ethanol-dry ice bath) up to room temperature. The cells 
are lysed via sonication in short 10 sec bursts over 3 minutes at approximately 80W while kept on 
ice. The sonicated sample is then centrifuged at 15,000 RPM for 30 minutes at 4°C. The 
supernatant is passed through a column containing 1.0 ml of CL-4B resin to pre -clear the sample 
of any proteins that may bind to agarose non-specifically, and the flow-through fraction is 
collected. 

The pre-cleared flow-through is applied to a nickel-nitrilo-tri-acetic acid ("Ni-NTA") 
affinity resin column (Quiagen, Inc., supra). Proteins with a 6 X His tag bind to the Ni-NTA 
resin with high affinity and can be purified in a simple one-step procedure. Briefly, the 
supernatant is loaded onto the column in Lysis Buffer A at 4°C, the column is first washed with 
10 volumes of Lysis Buffer A until the A280 of the eluate returns to the baseline. Then, the 
column is washed with 5 volumes of 40 mM Imidazole (92% Lysis Buffer A / 8% Buffer B) 
(Buffer B = 50 mM Na-Phosphate, 300 mM NaCl, 10% Glycerol, 10 mM 2-mercaptoethanol, 
500 mM Imidazole, pH of the final buffer should be 7.5). The protein is eluted off of the column 
with a series of increasing Imidazole solutions made by adjusting the ratios of Lysis Buffer A to 
Buffer B. Three different concentrations are used: 3 volumes of 75 mM Imidazole, 3 volumes of 
150 mM Imidazole, 5 volumes of 500 mM Imidazole. The fractions containing the purified 
protein are analyzed using 8 %, 10 % or 14% SDS-PAGE depending on the protein size. The 
purified protein is then dialyzed 2X against phosphate-buffered saline (PBS) in order to place it 
into an easily workable buffer. The purified protein is stored at 4°C or frozen at -80°. 

The following alternative method may be used to purify B. burgdorferi expressed in E coli 
when it is present in the form of inclusion bodies. Unless otherwise specified, all of the 
following steps are conducted at 4-10°C. 

Upon completion of the production phase of the E. coli fermentation, the cell culture is 
cooled to 4-10°C and the cells are harvested by continuous centrifugation at 15,000 rpm (Heraeus 
Sepatech). On the basis of the expected yield of protein per unit weight of cell paste and the 
amount of purified protein required, an appropriate amount of cell paste, by weight, is suspended 
in a buffer solution containing 100 mM Tris, 50 mM EDTA, pH 7.4. The cells are dispersed to a 
homogeneous suspension using a high shear mixer. 
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The cells are then lysed by passing the solution through a micro fluidizer (Microfuidics, 
Corp. or APV Gaulin, Inc.) twice at 4000-6000 psi. The homogenate is then mixed with NaCI 
solution to a final concentration of 0.5 M NaCI, followed by centrifugation at 7000 x g for 15 
min. The resultant pellet is washed again using 0.5M NaCI. 100 mM Tris, 50 mM EDTA, pH 
5 7.4. 

The resulting washed inclusion bodies are solubilized with 1.5 M guanidine hydrochloride 
(GuHCl) for 2-4 hours. After 7000 x g centrifugation for 15 min., the pellet is discarded and the 
B. burgdorferi polypeptide -containing supernatant is incubated at 4 C C overnight to allow further 
GuHCl extraction. 

10 Following high speed centrifugation (30,000 x g) to remove insoluble particles, the 

GuHCl solubilized protein is refolded by quickly mixing the GuHCl extract with 20 volumes of 
□ buffer containing 50 mM sodium, pH 4.5, 150 mM NaCI, 2 mM EDTA by vigorous stirring. 

The refolded diluted protein solution is kept at 4°C without mixing for 12 hours prior to further 
f=«l purification steps. 

Ms To clarify the refolded B. burgdorferi polypeptide solution, a previously prepared 

W 

(;;;! tangential filtration unit equipped with 0. 16 (im membrane filter with appropriate surface area 

(e.g., Filtron), equilibrated with 40 mM sodium acetate, pH 6.0 is employed. The filtered sample 
is loaded onto a cation exchange resin (e.g., Poros HS-50, Perseptive Biosystems). The column 
i*{ is washed with 40 mM sodium acetate, pH 6.0 and eluted with 250 mM, 500 rnM, 1000 mM, and 
iff 0 1500 mM NaCI in the same buffer, in a stepwise manner. The absorbance at 280 mm of the 

effluent is continuously monitored. Fractions are collected and further analyzed by SDS-PAGE. 

Fractions containing the B. burgdorferi polypeptide are then pooled and mixed with 4 
volumes of water. The diluted sample is then loaded onto a previously prepared set of tandem 
columns of strong anion (Poros HQ-50, Perseptive Biosystems) and weak anion (Poros CM-20, 
25 Perseptive Biosystems) exchange resins. The columns are equilibrated with 40 mM sodium 

acetate, pH 6.0. Both columns are washed with 40 mM sodium acetate, pH 6.0, 200 mM NaCI. 
The CM-20 column is then eluted using a 10 column volume linear gradient ranging from 0.2 M 
NaCI, 50 mM sodium acetate, pH 6.0 to 1.0 M NaCI, 50 mM sodium acetate, pH 6.5. Fractions 
are collected under constant A^g monitoring of the effluent. Fractions containing the B. 
30 burgdorferi polypeptide (determined, for instance, by 16% SDS-PAGE) are then pooled. 

The resultant B. burgdorferi polypeptide exhibits greater than 95% purity after the above 
refolding and purification steps. No major contaminant bands are observed from Commassie blue 
stained 16% SDS-PAGE gel when 5 fig of purified protein is loaded. The purified protein is also 
tested for endotoxin/LPS contamination, and typically the LPS content is less than 0.1 ng/ml 
35 according to LAL assays. 



3(b), Alternative Expression and Purification Borrelia polypeptides in E. 
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coli 

Tthe vector pQElO is alternatively used to clone and express some of the polypeptides of 
the present invention for use in the soft tissue and systemic infection models discussed below. 
The difference being such that an inserted DNA fragment encoding a polypeptide expresses that 
polypeptide with the six His residues (i.e., a "6 X His tag") covalently linked to the amino 
terminus of that polypeptide. The bacterial expression vector pQElO (QIAGEN, Inc., 9259 Eton 
Avenue, Chatsworth, CA, 91311) was used in this example . The components of the pQElO 
plasmid are arranged such that the inserted DNA sequence encoding a polypeptide of the present 
invention expresses the polypeptide with the six His residues (i.e., a "6 X His tag")) covalently 
linked to the amino terminus. 

The DNA sequences encoding the desired portions of a polypeptide of Table 1 were 
amplified using PCR oligonucleotide primers from genomic B. burgdorferi DNA. The PCR 
primers anneal to the nucleotide sequences encoding the desired amino acid sequence of a 
polypeptide of the present invention. Additional nucleotides containing restriction sites to 
facilitate cloning in the pQElO vector were added to the 5' and 3' primer sequences, respectively. 

For cloning a polypeptide of the present invention, the 5 r and 3' primers were selected to 
amplify their respective nucleotide coding sequences. One of ordinary skill in the art would 
appreciate that the point in the protein coding sequence where the 5' and 3' primers begins may be 
varied to amplify a DNA segment encoding any desired portion of a polypeptide of the present 
invention. The 5' primer was designed so the coding sequence of the 6 X His tag is aligned with 
the restriction site so as to maintain its reading frame with that of B. burgdorferi polypeptide. The 
3' was designed to include an stop codon. The amplified DNA fragment was then cloned, and the 
protein expressed, as described above for the pQE60 plasmid. 

The DNA sequences of Table 1 encoding amino acid sequences may also be cloned and 
expressed as fusion proteins by a protocol similar to that described directly above, wherein the 
pET-32b(+) vector (Novagen, 601 Science Drive, Madison, WI 5371 1) is preferentially used in 
place of pQE 10. 

The above methods are not limited to the polypeptide fragements actually produced. The 
above method, like the methods below, can be used to produce either full length polypeptides or 
desired fragements therof. 

3(c). Alternative Expression and Purification of Borrelia polypeptides in 
E. coli 

The bacterial expression vector pQE60 is used for bacterial expression in this example 
(QIAGEN, Inc., 9259 Eton Avenue, Chatsworth, CA, 91311).. However, in this example, the 
polypeptide coding sequence is inserted such that translation of the six His codons is prevented 
and, therefore, the polypeptide is produced with no 6 X His tag. 

The DNA sequence encoding the desired portion of the B. burgdorferi amino acid 
sequence is amplified from an B. burgdorferi genomic DNA prep the deposited DNA clones 
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using PCR oligonucleotide primers which anneal to the 5' and 3' nucleotide sequences 
corresponding to the desired portion of the B. burgdorferi polypeptides. Additional nucleotides 
containing restriction sites to facilitate cloning in the pQE60 vector are added to the 5' and 3' 
primer sequences. 

For cloning a B. burgdorferi polypeptides of the present invention, 5' and 3' primers are 
selected to amplify their respective nucleotide coding sequences. One of ordinary skill in the art 
would appreciate that the point in the protein coding sequence where the 5' and 3' primers begin 
may be varied to amplify a DNA segment encoding any desired portion of a polypeptide of the 
present invention. The 3' and 5' primers contain appropriate restriction sites followed by 
nucleotides complementary to the 5' and 3' ends of the coding sequence respectively. The 3' 
primer is additionally designed to include an in-frame stop codon. 

The amplified B. burgdorferi DNA fragments and the vector pQE60 are digested with 
restriction enzymes recognizing the sites in the primers and the digested DNAs are then ligated 
together. Insertion of the B. burgdorferi DNA into the restricted pQE60 vector places the B. 
burgdorferi protein coding region including its associated stop codon downstream from the IPTG- 
inducible promoter and in-frame with an initiating AUG. The associated stop codon prevents 
translation of the six histidine codons downstream of the insertion point. 

The ligation mixture is transfonried into competent E. coli cells using standard procedures 
such as those described by Sambrook et al. E. coli strain M15/rep4, containing multiple copies of 
the plasmid pREP4, which expresses the lac repressor and confers kanamycin resistance 
("Kanr"), is used in carrying out the illustrative example described herein. This strain, which is 
only one of many that are suitable for expressing B. burgdorferi polypeptide, is available 
commercially (Q3AGEN, Inc., supra). Transformants are identified by their ability to grow on 
LB plates in the presence of ampicillin and kanamycin. Plasmid DNA is isolated from resistant 
colonies and the identity of the cloned DNA confirmed by restriction analysis, PCR and DNA 
sequencing. 

Clones containing the desired constructs are grown overnight ("O/N") in liquid culture in 
LB media supplemented with both ampicillin (100 jig/ml) and kanamycin (25 jig/ml). The O/N 
culture is used to inoculate a large culture, at a dilution of approximately 1:25 to 1:250. The cells 
are grown to an optical density at 600 nm ("OD600") of between 0.4 and 0.6. isopropyl-b-D- 
thiogalactopyranoside ('TPTG") is then added to a final concentration of 1 mM to induce 
transcription from the lac repressor sensitive promoter, by inactivating the lad repressor. Cells 
subsequently are incubated further for 3 to 4 hours. Cells then are harvested by centrifugation. 

To purify the B. burgdorferi polypeptide, the cells are then stirred for 3-4 hours at 4°C in 
6M guanidine-HCl, pH 8. The cell debris is removed by centrifugation, and the supernatant 
containing the B. burgdorferi polypeptide is dialyzed against 50 mM Na-acetate buffer pH 6, 
supplemented with 200 mM NaCI. Alternatively, the protein can be successfully refolded by 
dialyzing it against 500 mM NaCI, 20% glycerol, 25 mM Tris/HCl pH 7.4, containing protease 
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inhibitors. After renaturation the protein can be purified by ion exchange, hydrophobic interaction 
and size exclusion chromatography. Alternatively, an affinity chromatography step such as an 
antibody column can be used to obtain pure B. burgdorferi polypeptide. The purified protein is 
stored at 4°C or frozen at -80° C. 

The following alternative method may be used to purify B. burgdorferi polypeptides 
expressed in E coli when it is present in the form of inclusion bodies. Unless otherwise 
specified, all of the following steps are conducted at 4-10°C 

Upon completion of the production phase of the E. coli fermentation, the cell culture is 
cooled to 4-10°C and the cells are harvested by continuous centrifugation at 15,000 rpm (Heraeus 
Sepatech). On the basis of the expected yield of protein per unit weight of ceil paste and the 
amount of purified protein required, an appropriate amount of cell paste, by weight, is suspended 
in a buffer solution containing 100 mM Tris, 50 mM EDTA, pH 7.4. The cells are dispersed to a 
homogeneous suspension using a high shear mixer. 

The cells ware then lysed by passing the solution through a microfluidizer (Microfuidics, 
Corp. or APV Gaulin, Inc.) twice at 4000-6000 psi. The homogenate is then mixed with NaCl 
solution to a final concentration of 0.5 M NaCl, followed by centrifugation at 7000 x g for 15 
min. The resultant pellet is washed again using 0.5M NaCl, 100 mM Tris, 50 mM EDTA, pH 
7.4. 

The resulting washed inclusion bodies are solubilized with 1.5 M guanidine hydrochloride 
(GuHCl) for 2-4 hours. After 7000 x g centrifugation for 15 min., the pellet is discarded and the 
B. burgdorferi polypeptide -containing supernatant is incubated at 4°C overnight to allow further 
GuHCl extraction. 

Following high speed centrifugation (30,000 x g) to remove insoluble particles, the 
GuHCl solubilized protein is refolded by quickly mixing the GuHCl extract with 20 volumes of 
buffer containing 50 mM sodium, pH 4.5, 150 mM NaCl, 2 mM EDTA by vigorous stirring. 
The refolded diluted protein solution is kept at 4°C without mixing for 12 hours prior to further 
purification steps. 

To clarify the refolded B. burgdorferi polypeptide solution, a previously prepared 
tangential filtration unit equipped with 0.16 \im membrane filter with appropriate surface area 
(e.g., Filtron), equilibrated with 40 mM sodium acetate, pH 6.0 is employed. The filtered sample 
is loaded onto a cation exchange resin (e.g., Poros HS-50, Perseptive Biosystems). The column 
is washed with 40 mM sodium acetate, pH 6.0 and eluted with 250 mM, 500 mM, 1000 mM, and 
1500 mM NaCl in the same buffer, in a stepwise manner. The absorbance at 280 mm of the 
effluent is continuously monitored. Fractions are collected and further analyzed by SDS-PAGE. 

Fractions containing the B. burgdorferi polypeptide are then pooled and mixed with 4 
volumes of water. The diluted sample is then loaded onto a previously prepared set of tandem 
columns of strong anion (Poros HQ-50, Perseptive Biosystems) and weak anion (Poros CM-20, 
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Perseptive Biosystems) exchange resins. The columns are equilibrated with 40 mM sodium 
acetate, pH 6.0. Both columns are washed with 40 mM sodium acetate, pH 6.0, 200 mM NaCl. 
The CM-20 column is then eluted using a 10 column volume linear gradient ranging from 0.2 M 
NaCl, 50 mM sodium acetate, pH 6.0 to 1.0 M NaCl, 50 mM sodium acetate, pH 6.5. Fractions 
are collected under constant A 2g0 monitoring of the effluent. Fractions containing the B. 
burgdorferi polypeptide (determined, for instance, by 16% SDS-PAGE) are then pooled. 

The resultant B. burgdorferi polypeptide exhibits greater than 95% purity after the above 
refolding and purification steps. No major contaminant bands are observed from Commassie blue 
stained 16% SDS-PAGE gel when 5 jig of purified protein is loaded. The purified protein is also 
tested for endotoxin/LPS contamination, and typically the LPS content is less than 0. 1 ng/ml 
according to LAL assays. 

3(d). Cloning and Expression of B. burgdorferi in Other Bacteria 

B. burgdorferi polypeptides can also be produced in: B. burgdorferi using the methods of 
S. Skinner et al., (1988) Mol. Microbiol. 2:289-297 or J. I. Moreno (1996) Protein Expr. Purif. 
8(3):332-340; Lactobacillus using the methods of C. Rush et al., 1997 Appl. Microbiol. 
Biotechnol. 47(5):537-542; or in Bacillus subtilis using the methods Chang et al, U.S. Patent 
No. 4,952,508. 

4. Cloning and Expression in COS Cells 

A B. burgdorferi expression plasmid is made by cloning a portion of the DNA encoding a 
B. burgdorferi polypeptide into the expression vector pDNAI/Amp or pDNAIII (which can be 
obtained from Invitrogen, Inc.)r The expression vector pDNAI/amp contains: (1) an E. coli 
origin of replication effective for propagation in E. coli and other prokaryotic cells; (2) an 
ampicillin resistance gene for selection of plasmid-containing prokaryotic cells; (3) an SV40 origin 
of replication for propagation in eukaryotic cells; (4) a CMV promoter, apolylinker, an SV40 
intron; (5) several codons encoding a hemagglutinin fragment (i.e., an "HA" tag to facilitate 
purification) followed by a termination codon and polyadenylation signal arranged so that a DNA 
can be conveniently placed under expression control of the CMV promoter and operably linked to 
the S V40 intron and the polyadenylation signal by means of restriction sites in the polylinker. 
The HA tag corresponds to an epitope derived from the influenza hemagglutinin protein described 
by Wilson et al. 1984 Cell 37:767. The fusion of the HA tag to the target protein allows easy 
detection and recovery of the recombinant protein with an antibody that recognizes the HA 
epitope. pDNAITI contains, in addition, the selectable neomycin marker. 

A DNA fragment encoding a B. burgdorferi polypeptide is cloned into the polylinker 
region of the vector so that recombinant protein expression is directed by the CMV promoter. The 
plasmid construction strategy is as follows. The DNA from a B. burgdorferi genomic DNA prep 
is amplified using primers that contain convenient restriction sites, much as described above for 
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construction of vectors for expression of B. burgdorferi in E. coli. The 5' primer contains a 
Kozak sequence, an AUG start codon, and nucleotides of the 5' coding region of the B. 
burgdorferi polypeptide. The 3' primer, contains nucleotides complementary to the 3' coding 
sequence of the B. burgdorferi DNA, a stop codon, and a convenient restriction site. 
5 The PCR amplified DNA fragment and the vector, pDNAI/Amp, are digested with 

appropriate restriction enzymes and then ligated. The ligation mixture is transformed into an 
appropriate E. coli strain such as SURE™ (Stratagene Cloning Systems, La Jolla, CA 92037), 
and the transformed culture is plated on ampicillin media plates which then are incubated to allow 
growth of ampicillin resistant colonies. Plasmid DNA is isolated from resistant colonies and 
10 examined by restriction analysis or other means for the presence of the fragment encoding the B. 
burgdorferi polypeptide 

For expression of a recombinant B. burgdorferi polypeptide, COS cells are transfected 
i with an expression vector, as described above, using DEAE-dextran, as described, for instance, 
rj§ by Sambrook et al. (supra). Cells are incubated under conditions for expression of B. 
jffs burgdorferi by the vector. 

?jj j Expression of the B. burgdorferi-HA fusion protein is detected by radiolabeling and 

Ul immunoprecipitation, using methods described in, for example Harlow et al., supra.. To this 

end, two days after transfection, the cells are labeled by incubation in media containing 35 S- 
□ cysteine for 8 hours. The cells and the media are collected, and the cells are washed and the lysed 
go with detergent-containing RIPA buffer: 150 mM NaCl, 1% NP-40, 0.1% SDS, 1% NP-40, 0.5% 
**J DOC, 50 mM TRIS, pH 7.5, as described by Wilson et al. (supra ). Proteins are precipitated 
13 from the cell lysate and from the culture media using an HA-specific monoclonal antibody. The 
precipitated proteins then are analyzed by SDS-PAGE and autoradiography. An expression 
product of the expected size is seen in the cell lysate, which is not seen in negative controls. 

25 

5. Cloning and Expression in CHO Cells 

The vector pC4 is used for the expression of B. burgdorferi polypeptide in this example. 
. Plasmid pC4 is a derivative of the plasmid pSV2-dhfr (ATCC Accession No. 37 146). The 
plasmid contains the mouse DHFR gene under control of the SV40 early promoter. Chinese 

30 hamster ovary cells or other cells lacking dihydrofolate activity that are transfected with these 
plasmids can be selected by growing the cells in a selective medium (alpha minus MEM, Life 
Technologies) supplemented with the chemotherapeutic agent methotrexate. The amplification of 
the DHFR genes in cells resistant to methotrexate (MTX) has been well documented. See, e.g., 
Alt et al., 1978, J. Biol. Chem. 253:1357-1370; Hamlin et al., 1990, Biochem. etBiophys. 

35 Acta, 1097:107-143; Page etal., 1991, Biotechnology 9:64-68. Cells grown in increasing 
concentrations of MTX develop resistance to the drug by overproducing the target enzyme, 
DHFR, as a result of amplification of the DHFR gene. If a second gene is linked to the DHFR 
gene, it is usually co-amplified and over-expressed. It is known in the ait that this approach may 
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be used to develop cell lines carrying more than 1,000 copies of the amplified gene(s). 
Subsequently, when the methotrexate is withdrawn, cell lines are obtained which contain the 
amplified gene integrated into one or more chromosome(s) of the host cell. 

Plasmid pC4 contains the strong promoter of the long terminal repeat (LTR) of the Rouse 
5 Sarcoma Virus, for expressing a polypeptide of interest, Cullen, et al. (1985) Mol. Cell. Biol. 
5:438-447; plus a fragment isolated from the enhancer of the immediate early gene of human 
cytomegalovirus (CMV), Boshart, et al., 1985, Cell 41:521-530. Downstream of the promoter 
are the following single restriction enzyme cleavage sites that allow the integration of the genes: 
Bam HI, Xba I, and Asp 718. Behind these cloning sites the plasmid contains the 3' intron and 
1 0 polyadenylation site of the rat preproinsulin gene. Other high efficiency promoters can also be 
used for the expression, e.g., the human B-actin promoter, the SV40 early or late promoters or the 
long terminal repeats from other retroviruses, e.g., HIV and HTLVI. Clontech's Tet-Off and Tet- 
On gene expression systems and similar systems can be used to express the B. burgdorferi 

g polypeptide in a regulated way in mammalian cells (Gossen et al., 1992, Proc. Natl. Acad. Sci. 

;i5 USA 89:5547-555 1. For the polyadenylation of the mRNA other signals, e.g., from the human 

□ growth hormone or globin genes can be used as well. Stable cell lines carrying a gene of interest 
J ''J integrated into the chromosomes can also be selected upon co-transfection with a selectable 

□ marker such as gpt, G418 or hygromycin. It is advantageous to use more than one selectable 
:L marker in the beginning, e.g., G418 plus methotrexate. 

20 The plasmid pC4 is digested with the restriction enzymes and then dephosphorylated 

\U using calf intestinal phosphates by procedures known in the art. The vector is then isolated from 
P !| a 1% agarose gel. The DNA sequence encoding the B. burgdorferi polypeptide is amplified using 
H PCR oligonucleotide primers corresponding to the 5' and 3' sequences of the desired portion of 
the gene. A 5' primer containing a restriction site, a Kozak sequence, an AUG start codon, and 
25 nucleotides of the 5' coding region of the B. burgdorferi polypeptide is synthesized and used. A 
3' primer, containing a restriction site, stop codon, and nucleotides complementary to the 3' 
coding sequence of the B. burgdorferi polypeptides is synthesized and used. The amplified 
fragment is digested with the restriction endonucleases and then purified again on a 1% agarose 
gel. The isolated fragment and the dephosphorylated vector are then ligated with T4 DNA ligase. 
30 E. coli HB101 or XL-1 Blue cells are then transformed and bacteria are identified that contain the 
fragment inserted into plasmid pC4 using, for instance, restriction enzyme analysis. 

Chinese hamster ovary cells lacking an active DHFR gene are used for transfection. Five 
fig of the expression plasmid pC4 is cotransfected with 0.5 jig of the plasmid pSVneo using a 
lipid-mediated transfection agent such as Lipofectin™ or LipofectAMTNE.™ (LifeTechnologies 
35 Gaithersburg, MD). The plasmid pSV2-neo contains a dominant selectable marker, the neo gene 
from Tn5 encoding an enzyme that confers resistance to a group of antibiotics including G418. 
The cells are seeded in alpha minus MEM supplemented with 1 mg/ml G418. After 2 days, the 
cells are trypsinized and seeded in hybridoma cloning plates (Greiner, Germany) in alpha minus 




MEM supplemented with 10, 25, or 50 ng/ml of methotrexate plus 1 mg/ml G418. After about 
10-14 days single clones are trypsinized and then seeded in 6-well petri dishes or 10 ml flasks 
using different concentrations of methotrexate (50 nM, 100 nM, 200 nM, 400 nM, 800 nM). 
Clones growing at the highest concentrations of methotrexate are then transferred to new 6-well 
5 plates containing even higher concentrations of methotrexate (1 LiM, 2 ilM, 5 jlM, 10 mM, 20 
mM). The same procedure is repeated until clones are obtained which grow at a concentration of 
100-200 [iM. Expression of the desired gene product is analyzed, for instance, by SDS-PAGE 
and Western blot or by reversed phase HPLC analysis. 

6. Immunization and Detection of Immune Responses 

10 

6(a). B. burgdorferi propagation 

B. burgdorferi sensu stricto isolate B31 is propagated in tightly-closed containers at 34°C 
CI in modified Barbour-Stoenner-Kelly (BSKII) medium (Barbour, A.G., Yale J. Biol. Med. 
% 57:52 1 -525 (1984)) overlaid with a 5%O 2 /5%CO 2 /90%N 2 gas mixture. Cell densities of these 
13 cultures are determined by darkfield microscopy at 400X. 

iy-J Immunization of Mice and Challenge with B. burgdorferi. For active immunizations 

iiij BALB/cByJ mice (BALB, Jackson Laboratories) are injected intraperitoneally (i.p.) at week 0 
h J with 20 g of recombinant borrelial protein, or phosphate-buffered saline (PBS), emulsified with 
? fii | complete Freund's adjuvant (CFA), given a similar booster immunization in incomplete Freund's 
S<f adjuvant (UFA) at week 4, and challenged at week 6. For challenge B. burgdorferi are diluted in 
I™ BSKII from exponentially-growing cultures and mice are injected subcutaneously (s.c.) at the 
(*!! base of the tail with 0.1 ml of these dilutions (typically 10 3 -10 4 borreliae; approximately 10-100 
times the median infectious dose). Borreliae used for challenge are passaged fewer than six times 
in vitro. To assess infection, mice are sacrificed at 14-17 days post-challenge, and specimens 
25 derived from ear, bladder, and fibiotarsal joints are placed in BSKII plus 1.4% gelatin, 13 g/ml 
amphotericin B, 1.5 g/ml phosphomycin, and 15 g/ml rifampicin, and borrelia outgrowth at two 
or three weeks is quantified by darkfield microscopy. Batches of BSKH are qualified for 
infection testing by confirming that they supported the growth of 1-5 cells of isolate B31. In 
some instances seroconversion for protein P39 reactivity is also used to confirm infections (see 
30 below). Others have previously shown that mice elicited antibodies to P39 when inoculated with 
live borreliae by syringe or tick bite, but not with killed borreliae (Simpson, W.J., et al, J. Clin. 
Microbiol. 29:236-243 (1991)). 

6(b). Immunoassays 

35 - Several immunoassay formats are used to quantify levels of borrelia-specific antibodies 

(ELISA and immunoblot), and to evaluate the functional properties of these antibodies (growth 
inhibition assay). The ELISA and immunoblot assays are also used to detect and quantify 
antibodies elicited in response to borrelial infection that react with specific borrelial antigens. 
Where antibodies to certain borrelial antigens are elicited by infection this is taken as evidence that 




54 

the borrelial proteins in question are expressed in vivo. Absence of infection-derived antibodies 
(seroconversion) following borrelial challenge is evidence that infection is prevented or 
suppressed. The immunoblot assay is also used to ascertain whether antibodies raised against 
recombinant borrelial antigens recognize a protein of similar size in extracts of whole borreliae. 
5 Where the natural protein is of similar, or identical, size in the immunoblot assay to the 

recombinant version of the same protein, this is taken as evidence that the recombinant protein is 
the product of a full-length clone of the respective gene. 

Enzyme-Linked Immuno sorb ant Assay (ELISA). The ELISA is used to quantify levels of 
antibodies reactive with borrelial antigens elicited in response to immunization with these borrelial 
10 antigens. Wells of 96 well microtiter plates (Immunlon 4, Dynatech, Chantilly, Virginia, or 
equivalent) are coated with antigen by incubating 50 1 of 1 g/ml protein antigen solution in a 
suitable- buffer, typically 0.1 M sodium carbonate buffer at pH 9.6. After decanting unbound 
antigen, additional binding sites are blocked by incubating 100 1 of 3% nonfat milk in wash 
OT buffer (PBS, 0.2% Tween 20, pH 7.4). After washing, duplicate serial two-fold dilutions of sera 
;|^5 in PBS, Tween 20, 1% fetal bovine serum, are incubated for 1 hr, removed, wells are washed 
ilj . three times, and incubated with horseradish peroxidase-conjugated goat anti-mouse IgG. After 
three washes, bound antibodies are detected with H 2 0 2 and 2,2'-azino-di-(3-ethylbenzthiazoline 
\f sulfonate) (Schwan, T.G., et al, Proc. Natl. Acad. Sci. USA 92:2909-2913 (1985)) (ABTS®, 
Kirkegaard & Perry Labs., Gaithersburg, MD) and A 405 is quantified with a Molecular Devices, 
ffp Corp. (Menlo Park, California) Vmax™ plate reader. IgG levels twice the background level in 
%J serum from naive mice are assigned the minimum titer of 1 : 100. 

M 6(c). In Vitro Growth Inhibition Assay 

Unlike other bacteria, borreliae can be killed by the binding of specific antibodies to their 

25 surface antigens. The mechanism for this in vitro killing or growth-inhibitory effect is not 

known, but can occur in the absence of serum complement, or other immune effector functions. 
Antibodies elicited in animals receiving immunizations with specific borrelial antigens that result 
in protection from borrelial challenge usually will directly kill borreliae in vitro. Thus, the in vitro 
growth inhibition assay also has a high predictive value for the protective potency of the borrelial 

30 antibodies, although exceptions, such as antibodies against OspC which are weak at in vitro 
growth inhibition, have been observed. Also, this assay can be used to evaluate the serologic 
conservation of epitope binding protective antibodies. A microwell antibody titration assay 
(Sadziene, A., et al., J. Infect. Dis. 167:165-172 (1993)) is used to evaluate the growth inhibition 
(GI) properties of antisera against recombinant borrelial antigens against the homologous B3 1 

35 isolate, and against various strains of borrelia. Briefly, 10 5 borrelia in 100 1 BSKII are added to 
serial two-fold dilutions of sera in 100 1 BSKII in 96-well plates, and the plates are covered and 
incubated at 34°C in a 5%O 2 /5%CO 2 /90%N 2 gas mixture for 72 h prior to quantification of 
borrelia growth by darkfield microscopy. 
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6(d). Sodiumdodecylsulfate-Polyacrylamide Gel Electrophoresis 
(SDS-PAGE) and Immunoblotting 

Using a single well format, total borrelial protein extracts, recombinant borrelial antigen, 
or recombinant P39 samples (2 g of purified protein, or more for total borrelial extracts) are 
5 boiled in SDS/2-ME sample buffer before electrophoresis through 3% acrylamide stacking gels, 
and resolving gels of higher acrylamide concentration, typically 10-15% acrylamide monomer. 
Gels are electro-blotted to nitrocellulose membranes and lanes are probed with dilutions of 
antibody to be tested for reactivity with specific borrelial antigens, followed by the appropriate 
secondary antibody-enzyme (horseradish peroxidase) conjugate. When it is desirable to confirm 
10 that the protein had transferred following electro-blotting, membranes are stained with Ponceau S. 
Immunoblot signals from bound antibodies are detected on x-ray film as chemiluminescence using 
ECL™ reagents (Amersham Corp., Arlington Heights, Illinois). 

J:;; 6(e). Detection of Borrelia mRNA expression 

u'l5 Northern blot analysis is carried out using methods described by, among others, 

Q Sambrook et al, supra, to detect the expression of the B. burgdorferi nucleotide sequences of the 

|lf ' present invention in animal tissues. A cDNA probe containing an entire nucleotide sequence 

% shown in Table 1 is labeled with 32 P using the redzprime™ DNA labeling system (Amersham Life 

Science), according to manufacturer's instructions. After labeling, the probe is purified using a 
%:.2() CHROMA SPIN- 100™ column (Clontech Laboratories, Inc.), according to manufacturer's 
5 .~~ protocol number PT 1200-1. The purified labeled probe is then used to detect the expression of 
IJi Borrelia mRNA in an animal tissue sample. 

i3 Animal tissues, such as blood or spinal fluid, are examined with the labeled probe using 

ExpressHyb™ hybridization solution (Clontech) according to manufacturer's protocol number 
25 PT1 190- 1 . Following hybridization and washing, the blots are mounted and exposed to film at - 
70 C overnight, and films developed according to standard procedures. 

The disclosure of all publications (including patents, patent applications, journal articles, 
laboratory manuals, books,, or other documents) cited herein are hereby incorporated by reference 
in their entireties. 

30 The present invention is not to be limited in scope by the specific embodiments described 

herein, which are intended as single illustrations of individual aspects of the invention. 
Functionally equivalent methods and components are within the scope of the invention, in 
addition to those shown and described herein and will become apparant to those skilled in the art 
from the foregoing description and accompanying drawings. Such modifications are intended to 

35 fall within the scope of the appended claims. 

Provisional Application Serial No. 60/057,483 filed 3 September 1997 is incorporated by 
reference herein in its entirety. 
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TABLE 1 . Nucleotide and Amino Acid Sequences 



£101 .aa 

MSKIFLLFNAGFFFLKIIYVFSYPEIKNFSRQDPVFSDLKIKVLKYNKKQHIPLFFYSYKVKKGDTFFKIANKING 
WQSGIATINLLDSPAVSVGQEILIPSKKGVFVFDSKDYRFNWLLLATRDLAKAEKVKIKSNDRVYEFYFFDFVKNP 
DFGLFSGTELLFFLNANFIFPLKKFIVSSDFGFRNDPFTGNXSFHTGIDLAAPMNAFVYLLLLE 

tlOl .aa 

SYPEIKNFSRQDPVFSDLKIKVLKYNKKQHIPLFFYSYKVKKGDTFFKIANKINGWQSGIATINLLDSPAVSVGQE 
ILIPSKKGVFVFDSKDYRFNNLLLATRDLAKAEKVKIKRNDRVYEFYFFDFVKNPDFGLFSGTELLFFLNANFIFP 
LKKFIVSSDFGFRNDPFTGNKSFHTGIDLAAPMNAEVYLLLLE 



flOl .nt 

ATGAGTAAAATTTTTTTATTATTTAATGCAGGTTTCTTTTTTTTAAAAATAATTTATGTTTTTTCTTATCCAGAAA 
TAAAAAATTTCTCAAGGCAAGATCCTGTTTTTTCTGATCTTAAAATTAAAGTTTTAAAATATAACAAAAAACAACA 
TATTCCTCTGTTTTTTTACTCATATAAAGTTAAAAAAGGGGATACTTTTTTTAAAATTGCCAATAAAATAAATGGA 
TGGCAGTCCGGCATTGCTACTATTAATTTATTAGATTCTCCTGCTGTGAGTGTTGGGCAAGAGATTCTTATTCCCA 
GTAAAAAAGGAGTTTTTGTTTTTGATAGTAAAGATTATAGATTTAATAATTTGCTTTTAGCAACAAGGGATCTTGC 
TAAAGCTGAAAAGGTAAAAATTAAAAGGAACGACAGAGTTTATGAATTTTATTTTTTTGATTTTGTTAAGAATCCA 
GATTTTGGACTTTTTTCAGGCACAGAATTGCTTTTTTTCTTAAATGCCAATTTTATTTTTCCTTTAAAAAAATTTA 
TTGTTAGTTCTGATTTTGGATTTAGAAATGACCCTTTCACTGGCAACAAAAGTTTCCATACAGGAATAGATCTTGC 
AGCTCCAATGAATGCTGAAGTGTATCTTCTTCTTCTGGAATAG 



tlOl.nt 

TCTTATCCAGAAATAAAAAATTTCTCAAGGCAAGATCCTGTTTTTTCTGATCTTAAAATTAAAGTTTTAAAATATA 
ACAAAAAACAACATATTCCTCTGTTTTTTTACTCATATAAAGTTAAAAAAGGGGATACTTTTTTTAAAATTGCCAA 
TAAAATAAATGGATGGCAGTCCGGCATTGCTACTATTAATTTATTAGATTCTCCTGCTGTGAGTGTTGGGCAAGAG 
ATTCTTATTCCCAGTAAAAAAGGAGTTTTTGTTTTTGATAGTAAAGATTATAGATTTAATAATTTGCTTTTAGCAA 
CAAGGGATCTTGCTAAAGCTGAAAAGGTAAAAATTAAAAGGAACGACAGAGTTTATGAATTTTATTTTTTTGATTT 
TGTTAAGAATCCAGATTTTGGACTTTTTTCAGGCACAGAATTGCTTTTTTTCTTAAATGCCAATTTTATTTTTCCT 
i TTAAAAAAATTT ATTGTTAGTTCTGATTTTGGATTTAGAAATGAC C CTT TC ACTGGCAACAAAAGTTTC CATACAG 
GAATAGATCTTGCAGCTCCAATGAATGCTGAAGTGTATCTTCTTCTTCTGGAATAG 

fll.aa 

VKKYIKTIFLISMVYFYCCTTIKINHDYETDFKVLESPSKYINIDVIKATNEYIYIQITNNSLDWKINWQNTSLN 
NDKIVLKKEDLTimiETGYKNKYREFFIGPKToFKFKVYPLKIHSKNKNSNNLSSTIKYPSIFKLNITfO/GIEAKK 
TINVLITRTTKINITNK 

til .aa 

CCTTIKINHDYETDFKVLESPSKYINIDVIKATNEYIYIQITNNSLDWKINWQNTSLNNDKIVLKKEDLTINNET 
GYKNKYREFFIGPKTSFKFKVYPLKIHSKNKN 

£11. nt 

GTGGAAAAATTTCTTTTATTCCAGGAAATGAAAATATTGCAGATCTTGGTTTTCATAAAACTAAGTAGAAATATTG 
TCAAAAAATACATAAAAACAATATTTCTGATTTCAATGGTTTATTTTTATTGTTGTACGACAATAAAAATAAACCA 
TGATTATGAAACTGATTTTAAAGTTCTAGAATCTCCCTCTAAATACATCAATATAGATGTAATTAAAGCTACAAAT 
GAATATATTTATATTCAAATTACAAACAATAGCTTAGACGTAGTAAAAATAAATTGGCAAAACACTAGTCTTAACA 
ACGATAAGATCGTCTTAAAAAAAGAAGATCTTACAATAAACAA'TGAAACAGGGTATAAAAATAAATACAGAGAGTT 
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TTTTATTGGTCCTAAAACTTCATTTAAATTTAAAGTATATCCACTAAAAATTCATTCTAAAAACAAAAATAGCAAT 
AACTTAAGCTCAACTATTAAATATCCGTCTATTTTTAAGCTCAACATAACAAAAGTAGGAATTGAAGCAAAAAAAA 
CAATAAATGTTTTAATAACAAGAACTACAAAAATTAATATTACTAATAAATGA 



TGTTGTACGACAATAAAAATAAACCATGATTATGAAACTGATTTTAAAGTTCTAGAATCTCCCTCTAAATACATCA 
ATATAGATGTAATTAAAGCTACAAATGAATATATTTATATTCAAATTACAAACAATAGCTTAGACGTAGTAAAAAT 
AAATTGGCAAAACACTAGTCTTAACAACGATAAGATCGTCTTAAAAAAAGAAGATCTTACAATAAACAATGAAACA 
GGGTATAAAAATAAATACAGAGAGTTTTTTATTGGTCCTAAAACTTCATTTAAATTTAAAGTATATCCACTAAAAA 
TTCATTCTAAAAACAAAAATAGCAATAACTTAAGCTCAACTATTAAATATCCGTCTATTTTTAAGCTCAACATAAC 
AAAAGTAGGAATTGAAGCAAAAAAAACAATAAATGTTTTAATAACAAGAACTACAAAAATTAATATTACTAATAAA 
TGA 



MREFLYRNVFKKSFIVFLIFLTFSNAIFAQTIDDENSKKRDKLTLSQKSYLRELELSTDEDLKKWALKEGLKETDV 
SKIRELLLKKFGIDPELFIKGKGLAGSGRYKIIIETADNLENFTYGLTKDESIIFEGRVNILVEDIKENKKHNIKG 
DRIVLNKNSKKLYAIGNVEYILDMDTNEKLYFYGNEFLVDFDSQNFLLKNGILQKKMQKNQIDHILSFGGKVLKKI 
DNDVTILEQAFATTSKIPEPYYSIKASKIWALPSGDFGFLNAIFYMGRVPVFYIPFFFRPGDSLFFNPSLGLNPRK 
GFSVFNTWLFGNKSSSEDSSFLDFDFNSVYNSGKKPYIRNGYLTYFFAENLAPSVNXDYVKLIFDIYANLGFYSG 
IDFNLGNTLGHFKTLEGNFGLGFTRNVYSYDGGYYPFDNRTLKQSLFSFSNLNKGDVFGFEVPFRYLFKFKTEFLL 
SDALFSVVLEHYSDPYWIDFRDRIESATFFSLLNLDKDSVKEQTSISTFDWNLSSFYKRTFNDGSILDYKLNNLG 
LSFKLSGYENLWKSPLEKPKDWDPTRKWFYLERIYAPYIDr^FQKDLYNNQWTFPADTKEMIMRPEIKNLEDKD 
NDKKSVKEKNTKKTTELTKDLYIPPEPITLKNIDQSDSFFIRFGINPYLRNIWFFDNYGITSPKDFNYEIKNYIiFD 
IKNKTDIKIHADFYNRLITFENLLYLNTIEYSPLNKDFKVEDKDKKSEHSIINQINLNLLPFIRYPLFSRSTLKFE 
NKATLYSFNKKYDSDVKSLVNKNSSIFLSDPETFYQSLTASLIYDYDYFTTELSGELKNSFEDIKASSELKLSLDF 
PYLLQE^GIGIKYYKKFKEDAMKNSGISAVQSPLEPQKPSSPYKNLEMSPAL.YYKIEPRYLDYFKFSFLVAYDPLI 
NRVSELSFKLNVFDFQFLFAMKDDFEYNYDPLKGDFSKIGTTTKLVPYSLDSSYKKELYVLTFFDNKLSFTLGVDV 
GWKINLQKFTDNELRSALTLKFKYTEFLEIYFSTLSINTKTFKYFKGYMDQIGLEPVNFFVDLSKSFNFFNSQDRK 
DSLFKIKKFSSGFKFNFYDWKFVGEYNLEPDLLRGSDGIYSPIWRt<rNFTIYISWNFFAPIKASFENNKDTNYEFII 
NRKTKK 



IFAQTIDDENSKKRDKLTLSQKSYLRELELSTDEDLKKWALKEGLKETDVSKIRELLLKKFGIDPELFIKGKGLAG 
SGRYKI I I ETADNLENFT YGLTKDES I IFEGRVNI LVEDIKENKKHNIKGDRIVLNKNSKKL YAIGNVEYILDMDT 
NE1KLYFYGNEFLVDFDSQNFLLKNGILQKKMQKNQIDHILSFC-GKVLKKIDNDVTILEQAFATTSKIPEPYYSIK 
ASKIWALPSGDFGFLNAIFYMGRVPVFYIPFFFRPGDSLFFNPSLGLNPRKGFSVFNTVYLFGNKSSSEDSSFLDF 
DFNSVYNSGKKPYIRNGYLTYFFAENLAPSVNKDYVKLIFDIYANLGFYSGIDFNLGNTLGHFKTLEGNFGLGFTR 
NVYSYDGGYYPFDNRTLKQSLFSFSNLNKGDVFGFEVPFRYLFKFKTEFLLSDALFSWLEHYSDPYVNIDFRDRI 
ES ATFF SLLNLDKDSVKEQTS I STFDWNL SSF YKRTFNDGS I LDYKLNNLGLS FKL SGYENLYVKS PLEXPKDWD 
PTRKWFYLERIYAPYIDLNFQKDLYjSn^QWTFPADTKEMIMRPEIKNLEDKDNDKKSVKSKWTKKTTELTKDLYIPP 
EPITLKNIDQSDSFFIRFGINPYLRNNVFFDNYGITSPKDFNYEIKNYLFDIKNKTDIKIHADFYNRLITFENLLY 
LNTIEYSPLNKDFKVEDKDKKSEHSIINQINLNLLPFIRYPLFSRSTLKFENKATLYSFNKKYDSDVKSLVNKNSS 
IFLSDPETFYQSLTASLIYDYDYFTTELSGELKNSFEDIKASSELKLSLDFPYLLQEAGIGIKYYKKFKEDAMKNS 
GISAVQSPLEPQKPSSPYKNLEMSPALYYKIEPRYLDYFKFSFLVAYDPLINRVSELSFKLNA/FDFQFLFAMKDDF 
EYNYDPLKGDFSKIGTTTKLVPYSLDSSYKKELYVLTFFDNKLSFTLGVDVGWKINLQKFTDNELRSALTLKFKYT 
EFLEIYFSTLSINTKTFKYFKGYMDQIGLEPVNFFVDLSKSFNFFNSQDRKDSLFKIKKFSSGFKFNFYDWKFVGE 
YNLEPDLLRGSDGIYSPIWRNNFTIYISWNFFAPIKASFENNKDTNYEFIINRKTKK 



til .: 



.nt 



fl2 .aa 



tl2 . aa 



f!2 .nt 



ATGCGAGAATTCCTATACAGGAATGTTTTTAAAAAATCTTTTATAGTATTTTTAATTTTTTTAACATTTTCTAATG 
CAATTTTTGCCCAGACTATAGATGATGAAAATTCTAAAAAAAGGGATAAGCTAACTTTAAGTCAAAAATCTTATTT 
AAGAGAACTTGAGCTTTCAACCGATGAGGATTTAAAAAAATGGGCCTTAAAAGAGGGTTTAAAAGAAACAGATGTT 
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TCAAAAATACGAGAATTGCTTTTAAAAAAGTTTGGAATAGATCCTGAGCTTTTTATCAAAGGAAAGGGACTTGCCG 
GATCTGGTAGATATAAAATAATCATTGAAACTGCAGATAATCTTGAAAATTTCACTTATGGACTTACTAAAGATGA 
AAGTATTATTTTTGAAGGAAGAGTTAATATCTTGGTTGAAGATATTAAAGAAAATAAAAAGCACAATATTAAAGGC 
GACAGAATAGTCCTTAATAAGAACTCTAAAAAACTTTATGCTATTGGAAATGTTGAATATATTCTTGATATGGATA 
CCAATGAAAAGCTTTATTTTTATGGCAATGAATTTCTTGTCGATTTTGATTCTCAAAATTTTTTATTAAAAAATGG 
TATTCTTCAAAAAAAAATGCAAAAAAATCAAATAGATCATATTCTTTCGTTTGGAGGAAAGGTTTTAAAAAAGATA 
GACAATGATGTTACCATTTTGGAACAAGCTTTTGCAACAACTAGTAAAATTCCAGAGCCTTACTATTCAATCAAGG 
CTTCTAAAATATGGGCATTGCCCTCGGGAGATTTTGGGTTTTTAAATGCCATATTTTACATGGGAAGAGTTCCAGT 
ATTTTATATTCCTTTTTTTTTCAGACCGGGAGATAGTTTGTTTTTTAATCCATCTTTAGGTCTAAATCCACGAAAA 
GGTTTTTCTGTTTTTAATACCGTTTATCTTTTTGGTAATAAATCTTCAAGTGAAGATTCTTCTTTTTTGGATTTTG 
ATTTCAATTCTGTTTATAATTCGGGTAAAAAACCTTATATAAGAAATGGATATTTAACTTATTTTTTTGCAGAAAA 
TTTAGCACCCAGTGTTAATAAAGATTATGTTAAGCTTATTTTTGACATTTATGCTAATCTGGGATTTTATTCTGGA 
ATTGATTTTAATTTGGGCAATACTTTGGGGCATTTTAAAACTTTGGAAGGAAATTTTGGATTGGGTTTTACCAGGA 
ATGTTTATAGTTACGATGGAGGATATTATCCTTTTGATAATAGGACTTTAAAACAATCTCTTTTTAGTTTTTCCAA 
TCTTAACAAAGGAGATGTATTTGGGTTTGAAGTTCCTTTTAGATATTTATTTAAATTTAAAACAGAATTTCTTTTA 
AGTGATGCACTTTTCTCGGTTGTTTTAGAGCACTATTCTGACCCGTATGTTAATATTGATTTTAGAGATAGGATAG 
AAAGTGCTACATTTTTTTCTCTTTTAAATTTAGATAAAGATTCGGTTAAAGAGCAAACTAGCATTAGCACTTTTGA 
TTGGAATTTATCTTCTTTTTATAAGCGAACATTTAATGACGGTTCGATTTTAGATTATAAATTAAATAATTTAGGT 
TTAAGTTTTAAATTGTCGGGCTATGAAAATCTTTATGTTAAATCTCCTTTAGAGAAACCAAAAGATGTTAATGATC 
CTACAAGAAAATGGTTTTATTTGGAGAGAATTTATGCTCCATATATTGATTTGAATTTTCAAAAAGATCTTTACAA 
TAACCAATGGACATTTCCAGCTGATACTAAAGAAATGATAATGCGCCCAGAAATTAAAAATCTAGAAGATAAAGAT 
AATGATAAAAAGAGTGTGAAGGAGAAAAATACTAAAAAAACAACAGAATTAACCAAAGATTTATATATTCCTCCAG 
■ AACCAATTACTTTAAAAAATATTGATCAATCCGATTCTTTTTTTATTAGGTTTGGCATTAATCCTTATTTAAGAAA 
TAATGTTTTTTTTGATAATTATGGCATAACAAGTCCAAAGGACTTTAATTATGAAATAAAAAATTATTTATTTGAT 
ATAAAAAATAAAACGGATATAAAAATTCATGCTGATTTTTACAATCGTTTAATTACTTTTGAAAATTTATTATATC 
TTAATACTATTGAGTATAGTCCTTTAAATAAAGATTTTAAAGTTGAAGATAAAGATAAAAAAAGTGAGCACTCTAT 
TATTAACCAAATAAATTTAAACTTGCTTCCTTTTATTAGATATCCTTTATTTTCTAGAAGTACTTTAAAGTTTGAA 
AATAAGGCTACTTTATATTCATTTAATAAAAAATATGATTCTGATGTAAAATCTTTGGTTAATAAGAATAGTAGTA 
TTTTTTTATCTGATCCGGAAACTTTTTATCAAAG'I'TTAACAGCCTCTTTAATTTATGATTATGATTATTTTACTAC 
TGAGCTTTCAGGTGAATTAAAAAATAGTTTTGAAGATATTAAAGCTTCTTCTGAGCTTAAACTTTCTTTAGATTTT 
CCTTATTTGCTACAJVGAAGCTGGGATTGGAATTAAATATTATAAAAAGTTTAAAGAAGATGCTATGAAAAACTCTG 
GAATTTCTGCTGTTCAAAGTCCTTTGGAGCCTCAAAAACCATCATCGCCTTATAAAAATTTAGAAATGTCTCCTGC 
TTTGTATTATAAAATTGAGCCGAGATATTTGGATTATTTTAAATTTAGTTTTTTAGTCGCCTATGATCCTTTGATA 
AATAGAGTTTCTGAACTTTCTTTTAAGCTTAATGTTTTTGATTTTCAATTTTTGTTTGCTATGAAAGACGACTTTG 
AATATAATTATGATCCTTTAAAAGGAGATTTTTCCAAGATTGGTACTACAACCAAACTTGTTCCATATTCTTTAGA 
TTCTAGTTACAAAAAGGAATTGTACGTTTTAACTTTTTTTGACAATAAGCTTTCTTTTACCTTGGGGGTAGATGTT 
GGTTGGAAAATAAATTTGCAGAAATTTACGGATAATGAACTTCGATCTGCATTGACTTTGAAGTTTAAATATACAG 
AATTTTTAGAAATTTACTTTTCTACTTTATCTATTAATACTAAGACTTTTAAATATTTTAAAGGGTATATGGACCA 
AATTGGTCTAGAACCTGTTAATTTCTTTGTTGATTTATCAAAATCTTTCAATTTCTTTAATTCTCAAGACAGAAAA 
GATTCACTTTTTAAAATTAAAAAATTTTCATCAGGCTTTAAATTCAATTTTTATGATTGGAAATTTGTTGGAGAAT 
ATAATTTAGAACCAGATTTATTAAGGGGATCTGATGGGATTTATTCTCCTATTTGGAGAAATAATTTTACAATTTA 
TATTTCTTGGAACTTTTTTGCTCCTATAAAAGCGTCATTTGAAAACAACAAAGATACAAACTACGAGTTTATTATT 
AATAGAAAAACAAAAAAATAA 

tl2 .nt 

ATTTTTGCCCAGACTATAGATGATGAAAATTCTAAAAAAAGGGATAAGCTAACTTTAAGTCAAAAATCTTATTTAA 
GAGAACTTGAGCTTTCAJVCCGATGAGGATTTAAAAAAATGGGCCTTAAAAGAGGGTTTAAAAGAAACAGATGTTTC 
AAAAATACGAGAATTGCTTTTAAAAAAGTTTGGAATAGATCCTGAGCTTTTTATCAAAGGAAAGGGACTTGCCGGA 
TCTGGTAGATATAAAATAATCATTGAAACTGCAGATAATCTTGAAAATTTCACTTATGGACTTACTAAAGATGAAA 
GTATTATTTTTGAAGGAAGAGTTAATATCTTGGTTGAAGATATTAAAGAAAATAAAAAGCACAATATTAAAGGCGA 
CAGAATAGTCCTTAATAAGAACTCTAAAAAACTTTATGCTATTGGAAATGTTGAATATATTCTTGATATGGATACC 
AATGAAAAGCTTTATTTTTATGGCAATGAATTTCTTGTCGATTTTGATTCTCAAAATTTTTTATTAAAAAATGGTA 
TTCTTCAAAAAAAAATGCAAAAAAATCAAATAGATCATATTCTTTCGTTTGGAGGAAAGGTTTTAAAAAAGATAGA 
CAATGATGTTACCATTTTGGAACAAGCTTTTGCAACAACTAGTAAAATTCCAGAGCCTTACTATTCAATCAAGGCT 
TCTAAAATATGGGCATTGCCCTCGGGAGATTTTGGGTTTTTAAATGCCATATTTTACATGGGAAGAGTTCCAGTAT 
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■TTTATATTCCTTTTTTTTTCAGACCGGGAGATAGTTTGTTTTTTAATCCATCTTTAGGTCTAAATCCACGAAAAGG 
TTTTTCTGTTTTTAATACCGTTTATCTTTTTGGTAATAAATCTTCAAGTGAAGATTCTTCTTTTTTGGATTTTGAT 
TTCAATTCTGTTTATAATTCGGGTAAAAAACCTTATATAAGAAATGGATATTTAACTTATTTTTTTGCAGAAAATT 
TAGCACCCAGTGTTAATAAAGATTATGTTAAGCTTATTTTTGACATTTATGCTAATCTGGGATTTTATTCTGGAAT 
TGATTTTAATTTGGGCAATACTTTGGGGCATTTTAAAACTTTGGAAGGAAATTTTGGATTGGGTTTTACCAGGAAT 
GTTTATAGTTACGATGGAGGATATTATCCTTTTGATAATAGGACTTTAAAACAATCTCTTTTTAGTTTTTCCAATC 
TTAACAAAGGAGATGTATTTGGGTTTGAAGTTCCTTTTAGATATTTATTTAAATTTAAAACAGAATTTCTTTTAAG 
TGATGCACTTTTCTCGGTTGTTTTAGAGCACTATTCTGACCCGTATGTTAATATTGATTTTAGAGATAGGATAGAA 
AGTGCTACATTTTTTTCTCTTTTAAATTTAGATAAAGATTCGGTTAAAGAGCAAACTAGCATTAGCACTTTTGATT 
GGAATTTATCTTCTTTTTATAAGCGAACATTTAATGACGGTTCGATTTTAGATTATAAATTAAATAATTTAGGTTT 
AAGTTTTAAATTGTCGGGCTATGAAAATCTTTATGTTAAATCTCCTTTAGAGAAACCAAAAGATGTTAATGATCCT 
ACAAGAAAATGGTTTTATTTGGAGAGAATTTATGCTCCATATATTGATTTGAATTTTCAAAAAGATCTTTACAATA 
ACCAATGGACATTTCCAGCTGATACTAAAGAAATGATAATGCGCCCAGAAATTAAAAATCTAGAAGATAAAGATAA 
TGATAAAAAGAGTGTGAAGGAGAAAAATACTAAAAAAACAACAGAATTAACCAAAGATTTATATATTCCTCCAGAA 
CCAATTACTTTAAAAAATATTGATCAATCCGATTCTTTTTTTATTAGGTTTGGCATTAATCCTTATTTAAGAAATA 
ATGTTTTTTTTGATAATTATGGCATAACAAGTCCAAAGGACTTTAATTATGAAATAAAAAATTATTTATTTGATAT 
AAAAAATAAAACGGATATAAAAATTCATGCTGATTTTTACAATCGTTTAATTACTTTTGAAAATTTATTATATCTT 
AATACTATTGAGTATAGTCCTTTAAATAAAGATTTTAAAGTTGAAGATAAAGATAAAAAAAGTGAGCACTCTATTA 
TTAACCAAATAAATTTAAACTTGCTTCCTTTTATTAGATATCCTTTATTTTCTAGAAGTACTTTAAAGTTTGAAAA 
TAAGGCTACTTTATATTCATTTAATAAAAAATATGATTCTGATGTAAAATCTTTGGTTAATAAGAATAGTAGTATT 
TTTTTATCTGATCCGGAAACTTTTTATCAAAGTTTAACAGCCTCTTTAATTTATGATTATGATTATTTTACTACTG 
AGCTTTCAGGTGAATTAAAAAATAGTTTTGAAGATATTAAAGCTTCTTCTGAGCTTAAACTTTCTTTAGATTTTCC 
TTATTTGCTACAAGAAGCTGGGATTGGAATTAAATATTATAAAAAGTTTAAAGAAGATGCTATGAAAAACTCTGGA 
ATTTCTGCTGTTCAAAGTCCTTTGGAGCCTCAAAAACCATCATCGCCTTATAAAAATTTAGAAATGTCTCCTGCTT 
TGTATTATAAAATTGAGCCGAGATATTTGGATTATTTTAAATTTAGTTTTTTAGTCGCCTATGATCCTTTGATAAA 
TAGAGTTTCTGAACTTTCTTTTAAGCTTAATGTTTTTGATTTTCAATTTTTGTTTGCTATGAAAGACGACTTTGAA 
TATAATTATGATCCTTTAAAAGGAGATTTTTCCAAGATTGGTACTACAACCAAACTTGTTCCATATTCTTTAGATT 
CTAGTTACAAAAAGGAATTGTACGTTTTAACTTTTTTTGACAATAAGCTTTCTTTTACCTTGGGGGTAGATGTTGG 
TTGGAAAATAAATTTGCAGAAATTTACGGATAATGAACTTCGATCTGCATTGACTTTGAAGTTTAAATATACAGAA 
TTTTTAGAAATTTACTTTTCTACTTTATCTATTAATACTAAGACTTTTAAATATTTTAAAGGGTATATGGACCAAA 
TTGGTCTAGAACCTGTTAATTTCTTTGTTGATTTATCAAAATCTTTCAATTTCTTTAATTCTCAAGACAGAAAAGA 
TTCACTTTTTAAAATTAAAAAATTTTCATCAGGCTTTAAATTCAATTTTTATGATTGGAAATTTGTTGGAGAATAT 
AATTTAGAACCAGATTTATTAAGGGGATCTGATGGGATTTATTCTCCTATTTGGAGAAATAATTTTACAATTTATA 
TTTCTTGGAACTTTTTTGCTCCTATAAAAGCGTCATTTGAAAACAACAAAGATACAAACTACGAGTTTATTATTAA 
TAGAAAAACAAAAAAATAA 



MTKKLFVRVLIFLISNNYAFAKDTIKDLFFIQDILIKKEKYSEVLNNASLEGIIEIEHNGPY.IKDHDSEVKLILKE 
NGYRRNFNFFNLLNTSNIIKSLSLFDSRPKNIKENEIILLETKMIKENPYKRYKDDDDFELKLSVTRKNWQIYLIL 
DFNFLFDQRKTFPSIYIKEEDVSTIINSFMKLQDSSFLSPQAS 



KDTIKDLFFIQDILIKKEKYSEVLNNASLEGIIEIEHNGPYIKDHDSEVKLILKENGYRRNFNFFNLLNTSNIIKS 
LSLFDSRPKNIKENEIILLETKMIKENPYKRYKDDDDFELKLSVTRKNNQIYLILDFNFLFDQRKTFPSIYIKEED 
VSTIINSFMKLQDSSFLSPQAS 



ATGACAAAAAAATTGTTTGTGAGGGTATTAATCTTTTTAATATCCAATAATTATGCTTTTGCAAAAGACACAATCA- 
AAGATTTGTTCTTTATACAAGATATACTAATAAAAAAAGAGAAATATTCCGAGGTTCTAAATAATGCAAGCCTTGA 
AGGCATTATTGAAATTGAACATAACGGACCATACATTAAAGATCACGATTCAGAAGTTAAACTTATCCTAAAAGAA 
AACGGATATAGAAGAAATTTCAACTTTTTTAATCTTTTAAATACTAGTAATATAATCAAAAGTCTAAGCTTATTTG 
ACAGCAGACCAAAAAACATTAAAGAAAATGAAATCATATTATTAGAGACAAAAATGATTAAAGAAAATCCCTATAA 
ACGATACAAAGACGATGATGATTTTGAATTAAAACTAAGTGTAACTCGAAAAAATAATCAAATTTATTTAATTCTT 



fl29 .aa 



tl29 .aa 



fl29 .nt 
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GATTTCAATTTCCTATTTGATCAAAGAAAAACGTTTCCATCAATTTACATCAAAGAAGAAGATGTATCAACAATAA 
TAAACAGCTTCATGAAACTACAAGATTCAAGCTTTTTATCTCCTCAAGCTTCTTAA 

tl29 .nt 

AAAGACACAATCAAAGATTTGTTCTTTATACAAGATATACTAATAAAAAAAGAGAAATATTCCGAGGTTCTAAATA 
ATGCAAGCCTTGAAGGCATTATTGAAATTGAACATAACGGACCATACATTAAAGATCACGATTCAGAAGTTAAACT 
TATCCTAAAAGAAAACGGATATAGAAGAAATTTCAACTTTTTTAATCTTTTAAATACTAGTAATATAATCAAAAGT 
CTAAGCTTATTTGACAGCAGACCAAAAAACATTAAAGAAAATGAAATCATATTATTAGAGACAAAAATGATTAAAG 
AAAATCCCTATAAACGATACAAAGACGATGATGATTTTGAATTAAAACTAAGTGTAACTCGAAAAAATAATCAAAT 
TTATTTAATTCTTGATTTCAATTTCCTATTTGATCAAAGAAAAACGTTTCCATCAATTTACATCAAAGAAGAAGAT 
GTATCAACAATAATAAACAGCTTCATGAAACTACAAGATTCAAGCTTTTTATCTCCTCAAGCTTCTTAA 

fl42.aa 

^KISILYTLINIIIMLILISIVYLCKRKWSFTKRVFIALAIGIVFGMTIQYFYGTNSEITNETINliVISILGDGY 
VRLLKMIIIPLIITSIISAIIKLTNSKDVGKMSLLVILTLVFTAGIAAIIGIFTALALGLTAEGLQAGTIEILQSE 
KLQKGLEILNQTTITKKITDLIPQNIFEDFAGLRKNSTIGWIFSAIIGIAALKTSIKKPESIEFFKKIILTLQDI 
ILGWTLILKLTPYAILALMTKITATSEIKSIIKLGEFVIASYIAIGLTFLMKMTLIAINKLNPITFIKKIFPALS 
FAFISRSSAATIPINIEIQTKNLGVSEGIANLSSSFGTSIGQNGCAALHPAMLAIMIAPTQGINPTDISFILTLIG 
LIIITSFGAAGAGGGATTASLMVLSAMNFPVGLVGLVISVEPIIDMGRTAVNVGGSMLAGVISAKQLKQFNHNIYN 
QKELVNK 

tl42 .aa 

CKRKNVSFTKRVFIALAIGIVFGMTIQYFYGTNSEITNETINWISILGDGYVRLLKMIIIPLIITSIISAIIKLTN 
SKDVGKMSLLVILTLVFTAGIAAIIGIFTALALGLTAEGLQAGTIEILQSEKLQKGLEILNQTTITKKITDLIPQN 
IFEDFAGLRKNSTIGWIFSAIIGIAALKTSIKKPESIEFFKKIILTLQDIILGWTLILKLTPYAILALMTKITA 
TSEIKSIIKLGEFVIASYIAIGLTFLMHMTLIAINKLNPITFIKKIFPALSFAFISRSSAATIPINIEIQTKNLGV 
SEGIANLSSSFGTSIGQNGCAALHPAMLAIMIAPTQGINPTDISFILTLIGLIIITSFGAAGAGGGATTASLMVLS 
AMNFP VGLVGLVI SVE P 1 1 DMGRTAVNVGGSMLAGVI SAKQLKQFNHNI YNQKELVNK 



f!42 .nt 

TAAGAGGTAATAATGGATAAAATAAGTATATTATATACATTAATCAATATTATAATAATGCTTATTCTAATAAGCA 
TAGTTTATCTTTGTAAAAGAAAAAATGTTTCTTTTx^CAAAAAGAGTGTTTATAGCGTTAGCAATCGGAATAGTATT 
TGGAATGACCATTCAATATTTTTATGGAACAAATTCAGAAATAACAAACGAAACTATAAATTGGATAAGTATTTTG 
GGCGATGGATACGTAAGGCTCCTTAAAATGATTATAATCCCCTTAATAATAACATCAATAATCTCTGCAATAATAA 
AACTAACCAATAGTAAAGATGTTGGGAAAATGAGCCTACTTGTAATATTAACACTAGTATTTACAGCAGGTATTGC 
TGCCATAATTGGCATTTTCACTGCTTTAGCATTGGGATTAACAGCCGAAGGACTACAAGCGGGAACCATCGAAATT 
TTACAAAGTGAAAAATTGCAAAAAGGCCTTGAAATATTAAATCAAACAACAATCACAAAAAAAATCACAGATCTTA 
TTCCACAAAATATATTTGAAGATTTTGCAGGGCTTAGAAAAAACTCAACCATCGGGGTCGTGATATTTTCAGCTAT 
CATAGGAATAGCCGCCCTTAAAACATCTATCAAAAAGCCAGAATCAATAGAATTTTTTAAAAAAATAATATTAACA 
CTCCAAGACATAATATTAGGTGTAGTAACTTTGATTTTAAAACTAACGCCTTATGCTATATTAGCTTTAATGACAA 
AAATTACAGCAACCAGCGAAATCAAAAGCATAATAAAGCTTGGAGAATTTGTAATTGCTTCCTACATTGCCATAGG 
TCTTACATTTCTTATGCATATGACATTAATTGCAATAAATAAATTAAACCCAATTACTTTTATAAAAAAAATATTC 
CCAGCACTATCATTTGCATTCATATCTAGGTCGAGTGCTGCAACCATACCCATTAATATAGAAATTCAAACTAAAA 
ATCTGGGAGTAAGCGAAGGAATAGCAAATTTATCAAGCTCCTTTGGAACATCAATTGGGCAAAATGGTTGTGCAGC 
ACTACACCCCGCTATGCTTGCAATAATGATAGCACCAACTCAGGGAATAAACCCCACAGATATTTCATTTATACTC 
ACACTTATTGGATTAATAATAATAACTTCATTTGGAGCTGCTGGCC-CTGGTGGAGGCGCAACAACAGCCTCACTAA 
TGGTGCTCTCAGCAATGAACTTTCCAGTGGGATTGGTAGGACTTGTAATATCTGTTGAGCCTATAATTGACATGGG 
AAGAACAGCTGTTAATGTAGGCGGCTCAATGCTTGCAGGCGTTATATCTGCTAAACAGCTCAAACAATTCAACCAT 
AATATATACAACCAAAAAGAGCTTGTAAACAAATAA 



tl42 .nt 
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TGTAAAAGAAAAAATGTTTCTTTTACAAAAAGAGTGTTTATAGCGTTAGCAATCGGAATAGTATTTGGAATGACCA 
TTCAATATTTTTATGGAACAAATTCAGAAATAACAAACGAAACTATAAATTGGATAAGTATTTTGGGCGATGGATA 
CGTAAGGCTCCTTAAAATGATTATAATCCCCTTAATAATAACATCAATAATCTCTGCAATAATAAAACTAACCAAT 
AGTAAAGATGTTGGGAAAATGAGCCTACTTGTAATATTAACACTAGTATTTACAGCAGGTATTGCTGCCATAATTG 
GCATTTTCACTGCTTTAGCATTGGGATTAACAGCCGAAGGACTACAAGCGGGAACCATCGAAATTTTACAAAGTGA 
AAAATTGCAAAAAGGCCTTGAAATATTAAATCAAACAACAATCACAAAAAAAATCACAGATCTTATTCCACAAAAT 
ATATTTGAAGATTTTGCAGGGCTTAGAAAAAACTCAACCATCGGGGTCGTGATATTTTCAGCTATCATAGGAATAG 
CCGCCCTTAAAACATCTATCAAAAAGCCAGAATCAATAGAATTTTTTAAAAAAATAATATTAACACTCCAAGACAT 
AATATTAGGTGTAGTAACTTTGATTTTAAAACTAACGCCTTATGCTATATTAGCTTTAATGACAAAAATTACAGCA 
ACCAGCGAAATCAAAAGCATAATAAAGCTTGGAGAATTTGTAATTGCTTCCTACATTGCCATAGGTCTTACATTTC 
TTATGCATATGACATTAATTGCAATAAATAAATTAAACCCAATTACTTTTATAAAAAAAATATTCCCAGCACTATC 
ATTTGCATTCATATCTAGGTCGAGTGCTGCAACCATACCCATTAATATAGAAATTCAAACTAAAAATCTGGGAGTA 
AGCGAAGGAATAGCAAATTTATCAAGCTCCTTTGGAACATCAATTGGGCAAAATGGTTGTGCAGCACTACACCCCG 
CTATGCTTGCAATAATGATAGCACCAACTCAGGGAATAAACCCCACAGATATTTCATTTATACTCACACTTATTGG 
ATTAATAATAATAACTTCATTTGGAGCTGCTGGCGCTGGTGGAGGCGCAACAACAGCCTCACTAATGGTGCTCTCA 
GCAATGAACTTTCCAGTGGGATTGGTAGGACTTGTAATATCTGTTGAGCCTA.TAATTGACATGGGAAGAACAGCTG 
TTAATGTAGGCGGCTCAATGCTTGCAGGCGTTATATCTGCTAAACAGCTCAAACAATTCAACCATAATATATACAA 
CC AAAAAGAGC TTGT AAAC AAATAA 

fl47 ,aa 

MKII.IIGGTSAGTSAAAKANRLNKKLDITIYEKTNIVSFGTCGLPYFVGGFFDNPNTMISRTQEEFEKTGISVKTN 
HEVIKVDAKNNTIVIKNQKTGTIFNNTYDQLMIATGAKPIIPPINNINLENF 

VIIGGGYIGIEMVEAAKNKRKNVRLIQLDKHILIDSFDEEIVTIMEEELTKKGVNLHTNEFVKSLIGEKKAEGWT 
NKIQ-TYQADAVILATGIKPDTEFLENQLKTTKKGAIIVNEYGETSIKNIFSAGDCATIYNIVSKKNEYIPLATTANK 
LGRIVGENLAGNHTAFKGTLGSASIKILSLEAARTGLTEKDAKKLQIKYKTIFVKDKNHTNYYPGQEDLYIKLIYE 
ENTKIILGAQAIGKNGAVIRIHALSIAIYSKLTTKELGMMDFSYSPPFSRTWDILNIAGNAAK 

tl47.aa 

AAAKANRLNKKLDIT I YEKTNI VS FGTCGL PYF VGGFFDNPNTMI SRTQEEFEKTG I SVKTNHEVI KVD AKNNT IV 
IKNQKTGTIFM^TYDQLMIATGAKPIIPPINNINLENFHTLIOSrLEDGQKIKKLMDREEIKNIVIIGGGYIGIEMVE 
AAKNKRKWRLIQLDKHILIDSFDEEIVTIMEEELTKKGVNLHTNEFVKSLIGEKKAEGWTNKNTYQADAVILAT 
GIKPDTEFLENQLKTTKNGAIIWEYGETSIKNIFSAGDCATIYNIVSKKjSIEYIPLATTANKLGRIVGENLAGNHT 
AFKGTLGSASIKILSLEAARTGLTEKDAKKLQIKYKTIFVKDKNHTNYYPGQEDLYIKLIYEENTKIILGAQAIGK 
NGAVIRIHALSIAIYSKLTTKELGMMDFSYSPPFSRTWDILNIAGNAAK 

fl47 .nt 

ATGAAAATAATAATTATTGGGGGCACATCAGCAGGAACTAGTGCCGCAGCTAAAGCAAACCGCTTAAACAAAAAGC 
TAGACATTACTATCTATGAAAAAACAAATATTGTATCTTTTGGAACCTGTGGCCTGCCTTACTTTGTGGGGGGATT 
CTTTGACAACCCCAATACAATGATCTCAAGAACACAAGAAGAATTCGAAAAAACTGGAATCTCTGTTAAAACTAAC 
CACGAAGTTATCAAAGTAGATGCAAAAAACAATACAATTGTAATAAAAAATCAAAAAACAGGAACCATTTTTAACA 
ATACTTACGATCAACTTATGATAGCAACTGGTGCAAAACCTATTATTCCACCAATCAATAATATCAATCTAGAAAA 
TTTTCATACTCTGAAAAATTTAGAAGACGGTCAAAAAATAAAAAAATTAATGGATAGAGAAGAGATTAAAAATATA 
GTGATAATTGGTGGTGGATACATTGGAATTGAAATGGTAGAAGCAGCAAAAAATAAAAGAAAAAATGTAAGATTAA 
TTCAACTAGATAAGCACATACTCATAGATTGCTTTGACGAAGAAATAGTCACAATAATGGAAGAAGAACTAACAAA 
AAAGGGGGTTAATCTTCATACAAJ^TGAGTTTGTAAAAAGTTTAATAGGAGAAAAAAAGGCAGAAGGAGTAGTAACA 
AACAAAAATACTTATCAAGCTGACGCTGTTATACTTGCTACCGGAATAAAACCTGACACTGAATTTTTAGAAAACC 
AGCTTAAAACTACTAAAAATGGAGCAATAATTGTAAATGAGTATGGCGAAACTAGCATAAAAAATATTTTTTCTGC 
AGGAGATTGTGCAACTATTTATAATATAGTAAGTAAAAAAAATGAATACATACCCTTGGCAACAACAGCCAACAAA 
CTTGGAAGAATAGTTGGTGAAAATTTAGCTGGGAATCATACAGCATTTAAAGGCACATTGGGCTCAGCTTCAATTA 
AAATAC TATCTTT AGAAGC TGC AAGAAC AGGACTTAC AGAAAAAGATGC AAAAAAGC TC C AAATAAAAT AT AAAAC 
GATTTTTGTAAAGGACAAAAATCATACAAATTATTATCCAGGCCAAGAAGATCTTTATATTAAATTAATTTATGAG 
GAAAATACCAAAATAATCCTTGGGGCACAAGCAATAGGAAAAAATGGAGCCGTAATAAGAATTCATGCTTTATCAA 
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TABLE 1 . Nucleotide and Amino Acid Sequences 

TTGCAATCTATTCAAAACTTACAACAAAAGAGCTAGGGATGATGGATTTCTCATATTCCCCACCCTTCTCAAGAAC 
TTGGGATATATTAAATATTGCTGGCAATGCTGCCAAATAG 



GCCGCAGCTAAAGCAAACCGCTTAAACAAAAAGCTAGACATTACTATCTATGAAAAAACAAATATTGTATCTTTTG 
GAACCTGTGGCCTGCCTTACTTTGTGGGGGGATTCTTTGACAACCCCAATACAATGATCTCAAGAACACAAGAAGA 
ATTCGAAAAAACTGGAATCTCTGTTAAAACTAACCACGAAGTTATCAAAGTAGATGCAAAAAACAATACAATTGTA 
ATAAAAAATCAAAAAACAGGAACCATTTTTAACAATACTTACGATCAACTTATGATAGCAACTGGTGCAAAACCTA 
TTATTCCACCAATCAATAATATCAATCTAGAAAATTTTCATACTCTGAAAAATTTAGAAGACGGTCAAAAAATAAA 
AAAATTAATGGATAGAGAAGAGATTAAAAATATAGTGATAATTGGTGGTGGATACATTGGAATTGAAATGGTAGAA 
GCAGCAAAAAATAAAAGAAAAAATGTAAGATTAATTCAACTAGATAAGCACATACTCATAGATTCCTTTGACGAAG 
AAATAGTCACAATAATGGAAGAAGAACTAACAAAAAAGGGGGTTAATCTTCATACAAATGAGTTTGTAAAAAGTTT 
AATAGGAGAAAAAAAGGCAGAAGGAGTAGTAACAAACAAAAATACTTATCAAGCTGACGCTGTTATACTTGCTACC 
GGAATAAAACCTGACACTGAATTTTTAGAAAACCAGCTTAAAACTACTAAAAATGGAGCAATAATTGTAAATGAGT 
ATGGCGAAACTAGCATAAAAAATATTTTTTCTGCAGGAGATTGTGCAACTATTTATAATATAGTAAGTAAAAAAAA 
TGAATACATACCCTTGGCAACAACAGCCAACAAACTTGGAAGAATAGTTGGTGAAAATTTAGCTGGGAATCATACA 
GCATTTAAAGGCACATTGGGCTCAGCTTCAATTAAAATACTATCTTTAGAAGCTGCAAGAACAGGACTTACAGAAA 
AAGATGCAAAAAAGCTCCAAATAAAATATAAAACGATTTTTGTAAAGGACAAAAATCATACAAATTATTATCCAGG 
CCAAGAAGATCTTTATATTAAATTAATTTATGAGGAAAATACCAAAATAATCCTTGGGGCACAAGCAATAGGAAAA 
AATGGAGCCGTAATAAGAATTCATGCTTTATCAATTGCAATCTATTCAAAACTTACAACAAAAGAGCTAGGGATGA 
TGGATTTCTCATATTCCCCACCCTTCTCAAGAACTTGGGATATATTAAATATTGCTGGCAATGCTGCCAAATAG 



MLKFEFSDRFLLFSYFVLIMFIGSLLLMLPISWEGDGKLAYIDALFTAVSAVSITGLTTVKMEGFSTFGFILIMLL 
IQLGGLGFISITTFYLLIPKKKMNLTDARIIKQYSLSNIEYNPIRILKSILFITFSIEMIGLILILICFKLRGVNI 
SFLRALFTTISAFCNAGFSMHSESIYAWRDVPEAIVWSILII 

FFLIIIGAILFF FTEMHKLKAGYSMS TLIFNSIFYSIS TRTAGFNYLDNS L I SGRTQ IISLPFMF IGGAPGS TAGG 
IKITTFFLIVLAWKNQNGNGYIIGSYKVSIDSIRFALLFFARAIFILSFSFFMLLFFEGGSGNWKVIDLGYEVFS 
AFGTVGLSVGVTQDLSFWGKVIIIFTMFAGRIGLFSMAVFVSRKSRFEEFTRPRQDILVG 



WEGDGKLAYIDALFTAVSAVSITGLTTA/KMEGFSTFGFILIMLLIQLGGLGFISITTFYLLIPKKKMNLTDARIIK 
QYSLSNIEYNPIRILKSILFITFSIEMIGLILILICFKLRGVNISFLEALFTTISAFCNAGFSMHSESIYAWRDVP 
EAIVWSILIICGGLGFMVYRDVl^TIKNK^^ 

IFYSISTRTAGFNYLDNSLISGRTQIISLPFMFIGGAPGSTAGGIKITTFFLIVLAWKNQNGNC-YIIGSYKVSID 
SIRFALLFFARAIFILSFSFFMLLFFEGGSGNWKVIDLGYEVFSAFGTVGLSVGVTQDLSFWGKVIIIFTMFAGRI 
GLFSMAVFVSRKSRFEEFTRPRQDILVG 



ATGTTGAAATTTGAATTTAGCGACAGGTTTTTACTTTTTAGTTATTTTGTTTTAATTATGTTTATAGGCTCTCTTT 
TGTTGATGTTGCCTATTTCCTGGGAAGGTGATGGCAAATTAGCATACATTGATGCTCTTTTTACTGCTGTTTCTGC 
TGTAAGTATTACGGGCCTTACAACGGTTAAAATGGAAGGCTTTTCTACTTTTGGATTTATTTTGATAATGTTGCTA 
ATCCAGCTTGGGGGACTTGGATTTATAAGTATTACTACTTTTTATTTGCTTATACCTAAAAAGAAAATGAATTTAA 
CAGATGCAAGAATAATAAAGCAGTATTCCCTTTCAAATATAGAATATAATCCTATTAGAATTTTAAAAAGCATATT 
GTTTATAACTTTTTCAATTGAAATGATAGGTTTAATATTAATACTTATTTGTTTTAAACTTAGGGGAGTGAATATT 
TCATTCTTAGAGGCTTTGTTTACGACAATTTCTGCTTTTTGCAATGCAGGTTTTTCCATGCATTCTGAGAGTATTT 
ATGCATGGCGAGATGTT-CCTGAAGCTATAGTTGTGGTCTCTATTTTAATAATTTGTGGTGGGCTTGGGTTTATGGT 
CTATAGAGATGTAAATAACACTATTAAAAACAAAAAAAAACTATCGCTTCATGCCAAGATAGTTTTTTCTTTAAGC 
TTCTTTTTAATTATAATTGGTGCAATTTTATTTTTTTTTACAGAGATGCATAAATTAAAAGCTGGTTATTCAATGA 
GCACTTTAATATTTAATTCAATT'TTTTATTCGATTAGTACCAGAACAGCTGGTTTTAATTATCTTGATAATTCTTT 
AATAAGCGGAAGAACTCAAATAATTTCTCTACCATTCATGTTTATTGGTGGTGCACCCGGATCAACTGCAGGAGGG 
ATTAAGATTACAACATTTTTTTTAATTGTATTGGCTGTTGTTAAAAATCAAAACGGCAATGGATATATTATTGGTT 



tl47 .: 



£152 .aa 



tl52 .aa 



fl52 .nt 




# 
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TABLE 1 . Nucleotide and Amino Acid Sequences 

CTTACAAGGTTTCAATAGATAGTATAAGATTTGCACTTTTATTTTTTGCAAGAGCTATTTTTATTTTAAGTTTTTC 
TTTTTTCATGCTTCTTTTTTTTGAGGGAGGATCTGGCAATTGGAAGGTTATTGATTTAGGTTATGAAGTATTTTCT 
GCTTTTGGAACGGTTGGTCTTTCAGTTGGAGTAACTCAGGATTTGTCATTTTGGGGGAAAGTCATTATAATTTTTA 
CTATGTTTGCAGGACGAATAGGGCTTTTTTCAATGGCTGTTTTTGTTTCAAGAAAGTCGCGTTTTGAAGAATTTAC 
AAGGCCAAGGCAAGATATTTTGGTTGGTTGA 



TGGGAAGGTGATGGCAAATTAGCATACATTGATGCTCTTTTTACTGCTGTTTCTGCTGTAAGTATTACGGGCCTTA 
CAACGGTTAAAATGGAAGGCTTTTCTACTTTTGGATTTATTTTGATAATGTTGCTAATCCAGCTTGGGGGACTTGG 
ATTTATAAGTATTACTACTTTTTATTTGCTTATACCTAAAAAGAAAATGAATTTAACAGATGCAAGAATAATAAAG 
CAGTATTCCCTTTCAAATATAGAATATAATCCTATTAGAATTTTAAAAAGCATATTGTTTATAACTTTTTCAATTG 
AAATGATAGGTTTAATATTAATACTTATTTGTTTTAAACTTAGGGGAGTGAATATTTCATTCTTAGAGGCTTTGTT 
TACGACAATTTCTGCTTTTTGCAATGCAGGTTTTTCCATGCATTCTGAGAGTATTTATGCATGGCGAGATGTTCCT 
GAAGCTATAGTTGTGGTCTCTATTTTAATAATTTGTGGTGGGCTTGGGTTTATGGTGTATAGAGATGTAAATAACA 
CTATTAAAAACAAAAAAAAACTATCGCTTCATGCCAAGATAGTTTTTTCTTTAAGCTTCTTTTTAATTATAATTGG 
TGCAATTTTATTTTTTTTTACAGAGATGCATAAATTAAAAGCTGGTTATTCAATGAGCACTTTAATATTTAATTCA 
ATTTTTTATTCGATTAGTACCAGAACAGCTGGTTTTAATTATCTTGATAATTCTTTAATAAGCGGAAGAACTCAAA 
TAATTTCTCTACCATTCATGTTTATTGGTGGTGCACCCGGATCAACTGCAGGA.GGGATTAAGATTACAACATTTTT 
TTTAATTGTATTGGCTGTTGTTAAAAATCAAAACGGCAATGGATATATTATTGGTTCTTACAAGGTTTCAATAGAT 
AGTATAAGATTTGCACTTTTATTTTTTGCAAGAGCTATTTTTATTTTAAGTTTTTCTTTTTTCATGCTTCTTTTTT 
TTGAGGGAGGATCTGGCAATTGGAAGGTTATTGATTTAGGTTATGAAGTATTTTCTGCTTTTGGAACGGTTGGTCT 
TTCAGTTGGAGTAACTCAGGATTTGTCATTTTGGGGGAAAGTCATTATAATTTTTACTATGTTTGCAGGACGAATA 
GGGCTTTTTTCAATGGCTGTTTTTGTTTCAAGAAAGTCGCGTTTTGAAGAATTTACAAGGCCAAGGCAAGATATTT 
TGGTTGGTTGA 



MKINKTFILLFLFTKFSFVQAQANQILTEISPLSILSKNGKGSVYLKVSKSSDYILTLDKSSNSDFVFKIYDISNK 

KYITDKVKRRDFKIRLDKNSLYAIIYVGTKNENIKFSLTDLDFSILSSDSLKAKTSKIEKEDLFFTLKDLPVLNLT 

AKLKKYVLRIYKSNIYIAYQLENSDDIKVAEFIEDVGWFNLDSSVNRNITNIVNFDFSINSKGNLYIAFVTKSGAD 

FASELIVKKFNSRKWIDISPGHIENFGSLLNISIDLKDRLYLAYLREIRGEYKINLISNMGYGSIWTDVIHAYLSK 

GDSNVNSSNIGLISEPFLGIFYNYKSM^EIKSEFIVNNENAWVNANIPSVYMANFIKGFFDSNFNQIIMSFVSEm 

PIVNICPLKSSRWINISPlWEMEGLSADIGLYHNnSILFLA 

YGNQGLVISTLSSNSNELFFTLICQ 



NQILTEISPLSILSKNGKGSVYLKVSKSSDYILTLDKSSNSDFVFKIYDISNKKYITDKVKRRDFKIRLDICMSLYA 

IIYVGTKNENIKFSLTDLDFSILSSDSLKAKTSKIEKEDLFFTLKDLPVLNLTAKLKKYVLRIYKSNIYIAYQLEN 

SDDIKVAEFIEDVGWFNLDSSVNRNITNIWFDFSINSKGNLYIAFVTKSGADFASELIVKKFNSRKWIDISPGKI 

ENFGSLLNISIDLKDRLYLAYLREIRGEYKINLISNMGYGSIWTDVIHAYLSKGDSNVNSSNIGLISEPFLGIFYN 

YKSNNEIKSEFIVHNENAWVNANIPSVY]^^ 

GLSADIGLYKNMLFLAFEDNNNWLIYFIOTKWW^ 

CQ 



ATGAAAATAAATAAGACATTCATTTTGCTATTTTTATTTACAAAATTTTCTTTTGTTCAAGCTCAAGCAAATCAAA 
TATTAACAGAAATTAGTCCTTTAAGTATTTTAAGCAAAAATGGGAAAGGAAGTGTTTACTTAAAAGTTAGCAAATC 
TTCCGATTATATTTTAACCCTAGATAAGAGTTCAAATTCCGATTTTGTTTTTAAAATTTATGACATTTCTAATAAA 
AAATATATAACCGATAAAGTAAAAAGAAGAGATTTTAAAATAAGATTAGATAAAAATTCTCTTTATGCAATAATAT 
ATGTTGGTACTAAAAATGAAAACATAAAGTTTTCGCTTACAGATTTAGATTTTTCAATTTTAAGTAGCGATTCCCT 
GAAAGCTAAAACATCTAAGATTGAAAAAGAAGATTTATTTTTTACTTTAAAAGATTTGCCTGTTTTAAATTTAACT 



tl52 .nt 



fl54.aa 



tl54. 



fl54.nt 
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TABLE 1. Nucleotide and Amino Acid Sequences 



GCCAAGCTTAAAAAATATGTATTAAGGATTTATAAAAGCAATATTTATATTGCTTATCAGCTAGAAAATAGCGATG 
ATATTAAAGTTGCTGAATTTATTGAGGATGTTGGTTGGTTTAATCTTGATTCATCTGTTAATAGAAATATTACTAA 
TATAGTTAATTTTGATTTTTCAATTAATTCTAAAGGAAATTTATATATTGCTTTTGTTACGAAATCAGGGGCTGAT 
TTTGCCAGCGAGCTTATAGTTAAAAAATTTAATAGTAGAAAATGGATTGATATTAGTCCTGGTCACATAGAAAATT 
TTGGATCTTTATTAAATATTAGCATTGATTTAAAAGATAGGTTGTATTTAGCATATTTAAGGGAAATTAGGGGTGA 
ATATAAAATTAATTTAATCTCGAATATGGGTTACGGAAGTATTTGGACCGATGTAATACATGCTTATTTAAGTAAA 
GGTGATTCTAATGTTAATTCATCAAACATTGGTTTAATATCTGAACCTTTTTTGGGCATTTTTTATAATTATAAGT 
CAAATAATGAGATTAAATCTGAATTTATTGTAAACAATGAAAATGCTTGGGTAAATGCAAATATTCCTTCTGTTTA 
TATGGCCAATTTTATTAAAGGCTTTTTTGATTCTAATTTTAATCAAATAATTATGAGTTTTGTTTCTGAAAATAGA 
CCTATTGTAAACATTTGTCCTTTGAAAAGTAGTAGATGGATTAATATAAGTCCTAATGTTGAAATGGAAGGTTTAA 
GTGCTGACATTGGGCTTTATAAAAATAATTTGTTTTTAGCTTTTGAGGACAATAATAATGTGAGATTAATTTATTT 
TAAGAATAAAAATTGGTATTTTTTAAATAAGCTTGAGAATTTTAAGAGTAATGTTAAAAGCCCTCAGATTGGAATT 
TATGGCAATCAAGGGCTTGTAATCTCTACTTTAAGCTCTAATTCCAATGAATTATTTTTTACTTTGATTTGCCAAT 
GA 

tl54 .nt 



AATCAAATATTAACAGAAATTAGTCCTTTAAGTATTTTAAGCAAAAATGGGAAAGGAAGTGTTTACTTAAAAGTTA 
GCAAATCTTCCGATTATATTTTAACCCTAGATAAGAGTTCAAATTCCGATTTTGTTTTTAAAATTTATGACATTTC 
TAATAAAAAATATATAACCGATAAAGTAAAAAGAAGAGATTTTAAAATAAGATTAGATAAAAATTCTCTTTATGCA 
ATAATATATGTTGGTACTAAAAATGAAAACATAAAGTTTTCGCTTACAGATTTAGATTTTTCAATTTTAAGTAGCG 
ATTCCCTGAAAGCTAAAACATCTAAGATTGAAAAAGAAGATTTATTTTTTACTTTAAAAGATTTGCCTGTTTTAAA 
TTTAACTGCCAAGCTTAAAAAATATGTATTAAGGATTTATAAAAGCAATATTTATATTGCTTATCAGCTAGAAAAT 
AGCGATGATATTAAAGTTGCTGAATTTATTGAGGATGTTGGTTGGTTTAATCTTGATTCATCTGTTAATAGAAATA 
TTACTAATATAGTTAATTTTGATTTTTCAATTAATTCTAAAGGAAATTTATATATTGCTTTTGTTACGAAATCAGG 
GGCTGATTTTGCCAGCGAGCTTATAGTTAAAAAATTTAATAGTAGAAAATGGATTGATATTAGTCCTGGTCACATA 
GAAAATTTTGGATCTTTATTAAATATTAGCATTGATTTAAAAGATAGGTTGTATTTAGCATATTTAAGGGAAATTA 
GGGGTGAATATAAAATTAATTTAATCTCGAATATGGGTTACGGAAGTATTTGGACCGATGTAATACATGCTTATTT 
AAGTAAAGGTGATTCTAATGTTAATTCATCAAACATTGGTTTAATATCTGAACCTTTTTTGGGCATTTTTTATAAT 
TATAAGTCAAATAATGAGATTAAATCTGAATTTATTGTAAACAATGAAAATGCTTGGGTAAATGCAAATATTCCTT 
CTGTTTATATGGCCAATTTTATTAAAGGCTTTTTTGATTCTAATTTTAATCAAATAATTATGAGTTTTGTTTCTGA 
AAATAGACCTATTGTAAACATTTGTCCTTTGAAAAGTAGTAGATGGATTAATATAAGTCCTAATGTTGAAATGGAA 
GGTTTAAGTGCTGACATTGGGCTTTATAAAAATAATTTGTTTTTAGCTTTTGAGGACAATAATAATGTGAGATTAA 
TTTATTTTAAGAATAAAAATTGGTATTTTTTAAATAAGCTTGAGAATTTTAAGAGTAATGTTAAAAGCCCTCAGAT 
TGGAATTTATGGCAATCAAGGGCTTGTAATCTCTACTTTAAGCTCTAATTCCAATGAATTATTTTTTACTTTGATT 
TGCCAATGA 



f 157 .aa 



MKIFLKVIGRGILGRLMVFRKNYDYLALISLLIVSFVGILLIYSSDYNISGSLTKNEYIKQTFWVIIGFFLIFIVG 
KYDLKFVYSMVYPLYFLLILALIFTAFFGMTVNGARSWIGIWKLGGQPSEFGKWIILTLSKFYTEKKGYNEFFTF 
ITAFLLIFPSVILILLQPDFGTAIVYLTIFIFISFFAGIDLHYVLAFALIGFFSFVFAILPVWYEYKVNMGWFYL 
IFSNPFYFRVIMGVLLLILLISVLGFFISKYGLSIKIIYFYVFFASSILLVSIVFSKVLSKLMKTYQIKRFLVFLD 
PAIDAKGAGWNLNQVKIAIGSGGLLGKGFLKGPYTHANYVPSQSTDFIFSILAEEFGFLGVSTILILFFFLFFKFL 
IIMNKSQDRYMALVISGILGLLFFHTSFNVGMSLGVLPITGIPFPFLSYGGSSTITFFLAMSFYFNIESIVAMD 



tl57 .aa 



RKNYDYLALISLLIVSFVGILLIYSSDYNISGSLTKNEYIKQTFWVIIGFFLIFIVGKYDLKFVYSMVYPLYFLLI 
LALIFTAFFGMTVNGARSWIGIWKLGGQPSEFGKVVIILTLSKFYTEKKGYNEFFTFITAFLLIFPSVILILLQPD 
FGTAIVYLTIFIFISFFAGIDLHYVXAFALIGFFSFVFAILPVWYEYKVNMGNVFYLIFSNPFYFRVIMGVLLLIL 
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LISVLGFFISKYGLSIKIIYFYVFFASSILLVSIVFSKVLSKLMKTYQIKRFLVFLDPAIDAKGAGWNLNQVKIAI 
GSGGLLGKGFLKGPYTHANYVPSQSTDFIFSILAEEFGFLGVSTILILFFFLFFKFLIIMNKSQDRYMALVISGIL 
GLLFFHTSFNVGMSLGVLPITGIPFPFLSYGGSSTITFFLAMSFYFNIESIVAMD 



f 157 .nt 



ATGAAGATATTCTTAAAGGTTATAGGCCGTGGTATATTAGGTAGATTAATGGTTTTTAGAAAAAATTATGATTATT 
TGGCTTTGATAAGCTTACTTATAGTTTCTTTTGTTGGTATATTGTTGATTTATTCTAGCGATTATAATATTAGTGG 
ATCTTTAACCAAGAATGAATATATAAAACAAACCTTTTGGGTAATTATTGGATTTTTTCTAATTTTTATAGTGGGC 
AAATATGATTTAAAATTTGTTTATAGCATGGTATATCCTTTATATTTTTTATTAATATTGGCTTTAATTTTTACTG 
CATTTTTTGGAATGACAGTAAATGGAGCAAGATCTTGGATTGGCATATGGAAACTTGGAGGACAGCCTTCTGAATT 
TGGTAAAGTTGTTATTATTTTAACCCTTTCAAAATTTTACACTGAAAAAAAGGGTTATAATGAATTTTTTACCTTT 
ATTACTGCATTTTTATTAATTTTTCCATCGGTAATTCTTATATTATTGCAACCTGATTTTGGTACAGCAATAGTAT 
ATTTAACCATTTTTATATTTATTTCTTTTTTTGCAGGAATAGATTTGCACTATGTTTTAGCATTTGCGTTGATAGG 
GTTTTTTTCTTTTGTTTTTGCAATTTTACCGGTTTGGTATGAATATAAGGTGAATATGGGTAATGTATTTTATCTT 
ATTTTCTCAAATCCTTTTTATTTTAGAGTAATAATGGGAGTGCTGCTTTTAATTCTTTTGATTTCTGTTTTAGGAT 
TTTTCATTTCTAAATATGGTTTGAGTATTAAAATAATTTATTTTTATGTATTTTTTGCAAGTTCTATTTTATTAGT 
TTCAATAGTGTTTTCAAAGGTTCTTTCAAAGTTAATGAAGACTTATCAGATTAAACGGTTTTTGGTATTCTTAGAT 
CCGGCTATTGATGCTAAGGGTGCTGGTTGGAATTTAAATCAGGTTAAAATAGCAATTGGTTCTGGCGGTCTTTTGG 
GCAAAGGATTTTTAAAGGGACCTTATACCCACGCTAATTATGTGCCATCTCAAAGCACAGATTTTATTTTTTCTAT 
TCTTGCCGAAGAGTTTGGGTTTTTGGGTGTTAGCACTATTTTAATATTATTTTTTTTCCTTTTTTTTAAATTTTTG 
ATAATAATGAATAAAAGTCAAGATAGATATATGGCCTTAGTAATATCTGGAATTTTGGGACTTTTATTTTTTCATA 
CTTCTTTTAATGTTGGAATGTCTTTAGGAGTTCTTCCTATTACCGGGATTCCCTTTCCTTTTCTCTCTTATGGAGG 
TTCTTCTACTATTACATTTTTTTTAGCAATGTCTTTTTATTTTAATATTGAATCAATAGTTGCTATGGATTGA 



tl57 .nt 



AGAAAAAATTATGATTATTTGGCTTTGATAAGCTTACTTATAGTTTCTTTTGTTGGTATATTGTTGATTTATTCTA 
GCGATTATAATATTAGTGGATCTTTAACCAAGAATGAATATATAAAACAAACCTTTTGGGTAATTATTGGATTTTT 
TCTAATTTTTATAGTGGGCAAATATGATTTAAAATTTGTTTATAGCATGGTATATCCTTTATATTTTTTATTAATA 
TTGGCTTTAATTTTTACTGCATTTTTTGGAATGACAGTAAATGGAGCAAGATCTTGGATTGGCATATGGAAACTTG 
GAGGACAGCCTTCTGAATTTGGTAAAGTTGTTATTATTTTAACCCTTTCAAAATTTTACACTGAAAAAAAGGGTTA 
TAATGAATTTTTTACCTTTATTACTGCATTTTTATTAATTTTTCCATCGGTAATTCTTATATTATTGCAACCTGAT 
TTTGGTACAGCAATAGTATATTTAACCATTTTTATATTTATTTCTTTTTTTGCAGGAATAGATTTGCACTATGTTT 
TAGCATTTGCGTTGATAGGGTTTTTTTCTTTTGTTTTTGCAATTTTACCGGTTTGGTATGAATATAAGGTGAATAT 
GGGTAATGTATTTTATCTTATTTTCTCAAATCCTTTTTATTTTAGAGTAATAATGGGAGTGCTGCTTTTAATTCTT 
TTGATTTCTGTTTTAGGATTTTTCATTTCTAAATATGGTTTGAGTATTAAAATAATTTATTTTTATGTATTTTTTG 
CAAGTTCTATTTTATTAGTTTCAATAGTGTTTTCAAAGGTTCTTTCAAAGTTAATGAAGACTTATCAGATTAAACG 
■GTTTTTGGTATTCTTAGATCCGGCTATTGATGCTAAGGGTGCTGGTTGGAATTTAAATCAGGT'TAAAATAGCAATT 
GGTTCTGGCGGTCTTTTGGGCAAAGGATTTTTAAAGGGACCTTATACCCACGCTAATTATGTGCCATCTCAAAGCA 
CAGATTTTATTTTTTCTATTCTTGCCGAAGAGTTTGGGTTTTTGGGTGTTAGCACTATTTTAATATTATTTTTTTT 
CCTTTTTTTTAAATTTTTGATAATAATGAATAAAAGTCAAGATAGATATATGGCCTTAGTAATATCTGGAATTTTG 
GGACTTTTATTTTTTCATACTTCTTTTAATGTTGGAATGTCTTTAGGAGTTCTTCCTATTACCGGGATTCCCTTTC 
CTTTTCTCTCTTATGGAGGTTCTTCTACTATTACATTTTTTTTAGCAATGTCTTTTTATTTTAATATTGAATCAAT 
AGTTGCTATGGATTGA 



£17. aa 



MIVFLFFSIYLIILFKRSSNSPLYFVPDTKFETLSIRIVLSCSLLLIFFCTMLDARPSTIAVFPTPGSPISIALFL 
FLLKSIFVRVLISASLPTKGSNFLAFASAVKFLTYFPISKCSFSSRISSSNSL 



TABLE 1. Nucleotide and Amino Acid Sequences 



tl7 . aa 



PLYFVPDTKFETLSIRIVLSCSLLL1FFCTMLDARPSTIAVFPTPGSPISIALFLFLLKSIFVRVLISASLPTKGS 
NFLAFASAVKFLTYFPISKCSFSSRISSSNSL 



f 17 .rit 



ATGATTGTGTTTTTGTTTTTTTCAATATACTTAATTATATTATTTAAACGATCTTCAAACTCGCCTCTATATTTTG 
TTCCCGATACCAAGTTTGAAACCTTAAGCATTAGAATTGTTTTGTCTTGTAGTTTGCTACTTATTTTTTTTTGCAC 
TATGCTTGATGCAAGGCCTTCAACTATTGCTGTTTTTCCCACACCAGGTTCGCCTATTAGCATTGCACTATTTTTA 
TTTCTTCTCAAGAGTATATTTGTAAGAGTTTTAATCTCTGCTTCTCTTCCAACCAAGGGGTCTAATTTTTTGGCTT 
TTGCAAGTGCTGTTAAATTTTTGACATACTTTCCAATTTCAAAGTGCTCATTTTCAAGTCGTATTTCTTCATCAAA 
TTCTTTGTAG 



tl7 .nt 



CCTGTATATTTTGTTCCCGATACCAAGTTTGAAACCTTAAGCATTAGAATTGTTTTGTCTTGTAGTTTGCTACTTA 
TTTTTTTTTGCACTATGCTTGATGCAAGGCCTTCAACTATTGCTGTTTTTCCCACACCAGGTTCGCCTATTAGCAT 
TGCACTATTTTTATTTCTTCTCAAGAGTATATTTGTAAGAGTTTTAATCTCTGCTTCTCTTCCAACCAAGGGGTCT 
AATTTTTTGGCTTTTGCAAGTGCTGTTAAATTTTTGACATACTTTCCAATTTCAAAGTGCTCATTTTCAAGTCGTA 
TTTCTTCATCAAATTCTTTGTAG 



fl70 .aa 



MKAFKVKNLRRFSNFIRILVIVLFLNSLLSLFVFLAGSYNIFVYNFQKFYLDLAIILSSVSFGLESTRLIFFYFLK 
NKKIKYYLILIFSFIIFFIALVFKIFLSGNK 



tl70 .aa 



YNIFVYNFQKFYLDLAIILSSVSFGLESTRLIFFYFLKNKKIKYYLILIFSFIIFFIALVFKIFLSGNK 



fl70.nt 



ATGAAAGCTTTTAAAGTAAAAAATCTAAGACGTTTTTCAAATTTTATTAGAATTTTGGTTATTGTATTGTTTTTAA 

ATTCTTTGTTAAGTTTGTTCGTGTTTTTGGCTGGTTCTTACAATATTTTTGTTTACAATTTTCAGAAATTTTATCT 

TGATCTTGCTATTATTTTAAGCTCTGTTTCTTTTGGACTTGAATCTACTAGACTGATATTTTTTTATTTTTTGAAA 

AATAAAAAAATTAAGTATTATTTAATTTTAATTTTTAGTTTTATAATTTTTTTTATTGCTCTTGTTTTTAAAATTT 

TTCTTTCTGGTAATAA 

ATAG 



t!70 .nt 
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TACAA~A~rTCTC-rT~ACAArCTCCAGAAAr'T , rTATCTTGATCTTGCTATTATTTTAAGCTCTGTTTCTTTTGGAC 
TTGAArCTACCA.GACCGA.rA7C'TTrCCATTCTTTGA^J\AATAAAAAAATTAAGTATTATTTAATTTTAATTTTTAG 
TT'rTA-rAJ^ZTTTrTCCAJTT'GCTCTTGZCT^TAJ^AATTTTTCTTTCTGGTAATAAATAG 



JC<;iiYSEIAELRKKLNLNHLEIDDTLEKVAKEYAIKLGENRTITHTL 
^-'r/NAWLNS P SHKEAL INTDTDKIGGYRLKTTDNID I FWLFGKRK 



Fc3ZZA-3GZZZ3Z?"Z;TAZiVlZ^3?3HXZZA^ 



ATGAJ\A_AAAZZC4AZZAZAAZZZT?ACA-CTGZZ''ZTTATCTCAAGCATGCAATTTAAGTACAATGCATAAAATAGATA 
CAJAAAjGAZAGAZAZTGA^JITZCZA.ZAZZ^^ 

AGATGA-TACCCZ , Z , GAJJLAAGZCGCAAAJ i -GAATATGCCATTAAACTGGGAGAAAATAGAACAATAACTCACACCCTT 
TTZGGCACAACCCCAAZGCAAA-GAArACA.ZAAA.TACGATCAATCCTTTAATTTAACAAGAGAAATACTGGCATCAG 
GAATTGA_ACTZAJAZAGAGTAGZTAAIGCA.ZGGCTTAATAGTCCAAGCCACAAAGAAGCTCTTATTAATACAGATAC 
CGATAAJLAZA.GGZGGCZA.TAGA.TZAAA-AA.CGA.CTGACAATATAGATATATTTGTAGTTCTTTTTGGAAAAAGAAAA 

tazaagaazzga 



ACAATGCAZAZAA_AZA.GAjTACA^_AAGA_AGAZATGAAAATTCTATATTCAGAAATT 

ATCTA_AACCAJZCZAGAZAATAGA_TGATACCCTTGAA^ 

AJ^CAA-ZAA-CTCACACCCTTZC~GGCACAJVCCCCAATGCAAAGAATACATAAATACGATCAATCCTTTAATTTAACA 

agagazaazajctggcazcag:laa-tt^^ 

CTCTZATTAAZACAGA-ZACZGATAJAAATAGGTGGCTATAGATTAAAAACGACTGACAATATAGATATATTTGTAGT 
TCTTTZ , TGGAJLAAAGA_AAA.ZA.ZA-AGA_ATTGA 



MKZKARMZ LLVZ ZZZAF F I 3 ZZ-FFAFG^LINSKLVDQQFNLMINLIES IKSSFNLYISSMEEKVRVSSMYFNSAEK 

FtZZA.SKIXSXFZlZFZSOQSZZZIQZGSNl^^ 

IADPG3IZZSZZ0:<ZISKZKi72CKGQZ?YZLZGM^ 

TTGRZZ.VZ^-ZTVLZGZZLZZZSAZ^YSNZIKXTSSDL^ 

RTS£Z3FYY^IS?AZ:TZZZAZ:SFVF7Lli<[I^IATLY 

YZGLE QLP.7NT S 3YAX.3VZ ZXZZDYZ YZNAZ Q IANAS QNLS SGAVEQAS TLEQMTANI EQI SQGVS ENTENAATTSK 
IA 7NZZNE?.-r:-CEG:--Z ; :3 T "/KAZ ZAMT/ITEXZ GI I DEI TRQTNLLALNAS I EAARVGEKGKGF EWAAE VRKLADQ SK 
SSAREZ IZ IAZ\?.SZT^ZAS?AGZNFZQI V?GMEQTARLVKNI SNES YKQSVQ IEQFKNAI EQVSQLVQTTAS SSEEL 
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tl96 .aa 



MLINSKLVDQQFNLMINLIESIKSSFNLYISSMEEKVRVSSMYFNSAEKFNEASKIKSKRLSFISDQSEILIQTGS 
NMMVTDKEGKIVFTTAVKDNSDFGKSIGDREYFTKLKESNSIVYNSFVMLADPGSIEESLLKDISKIKNKKGQIPY 
ILIGMPLRDFETDNIFGYFMFLYSMDYIYRSFRGINFGILSSGRALAYDTTGRLLVHHWLPGDILTDISASYSNI 
IKKTSEDLLQKNKEISTVYYYDPKSNKKYVGISQKVLLNLSNNKFILLMRTSEDDFYYMSRATTIILAISFVFTLL 
MLAIATLYLVKKLSSSLNKILEYSERLASGNFTADINFGKWDTVELYSLYEGLEQLRTNFSSVAKGVIENLDYLYE 
NAIQIANASQNLSSGAVEQASTLEQMTANIEQISQGVSENTENAATTEKIAVNTNERTKEGHKSWKAIEAMTVIT 
EKIGriDEITRQTNLLALNASIEAARVGEKGKGFEWAAEVRKLADQSKESAREIIDIANRSLTVASRAGENFEQI 
VPGMEQTARLVKNISNESYKQSVQIEQFKNAIEQVSQLVQTTASSSEELSAMSEKMLESVKDLKESVDYFKIEK 



f 196 .nt 

ATGAAGCTTAAAGCTAGGATGTTGCTACTTGTTCTTATTCTGATAGCATTCTTTATATCAATTTTGTTTTTTGCTT 
TTGGAATGCTTATTAATAGTAAATTGGTGGATCAACAGTTTAATCTTATGATAAATCTTATTGAAAGCATTAAAAG 
TTCTTTTAATCTTTACATCTCTTCAATGGAAGAGAAAGTTAGGGTTAGTTCCATGTATTTCAACTCTGCTGAAAAA 
TTTAATGAGGCTAGTAAAATTAAATCCAAAAGGTTGAGCTTTATTTCAGATCAATCTGAAATTCTTATTCAAACCG 
GTAGTAATATGATGGTTACAGACAAAGAAGGTAAAATAGTGTTTACTACGGCGGTTAAGGATAATAGTGATTTTGG 
CAAATCTATTGGGGATAGAGAATATTTTACAAAACTTAAGGAGTCTAATAGTATTGTTTACAATTCCTTTGTCATG 
TTGGCAGATCCCGGGTCTATTGAGGAGTCTTTACTTAAAGATATTTeCAAGATAAAAAATAAAAAAGGTCAGATTC 
CTTACATATTAATAGGTATGCCATTAAGAGATTTTGAAACAGATAACATTTTTGGTTATTTTATGTTTCTTTATTC 
AATGGATTATATATATAGGTCTTTTAGAGGGATTAATTTTGGAATACTCTCTAGCGGTCGTGCGCTAGCTTATGAT 
ACTACGGGTAGATTGTTGGTTCATCATGTAGTATTGCCAGGTGATATTTTGACTGATATTAGTGCTTCTTATTCCA 
ATATTATTAAGAAAACATCTGAAGATTTGTTGCAAAAGAATAAAGAAATTTCAACTGTTTATTATTATGATCCTAA 
AAGCAATAAGAAATATGTGGGAATTAGTCAAAAGGTGTTATTAAACTTGTCTAATAATAAATTTATTCTTTTAATG 
AGAACTTCAGAGGACGATTTTTATTACATGTCACGAGCTACAACTATAATCTTAGCAATTAGTTTTGTATTTACAT 
TACTTATGCTTGCTATTGCAACTCTTTATCTTGTGAAAAAGTTAAGCTCTTCTTTGAATAAGATACTGGAATATTC 
TGAGAGACTTGCTTCTGGTAATTTTACTGCTGATATTAATTTTGGCAAATGGGATACTGTAGAGCTTTACAGTTTG 
TACGAAGGGCTTGAGCAGTTGAGAACCAATTTTTCTTCAGTTGCAAAAGGAGTTATTGAAAATCTAGATTATCTTT 
ATGAAAATGCAATTCAAATAGCAAATGCAAGCCAGAATTTAAGTTCTGGCGCTGTTGAGCAGGCTTCTACTTTAGA 
GCAAATGACAGCAAATATTGAGCAAATTTCACAAGGTGTTTCTGAGAATACTGAAAATGCAGCTACTACTGAAAAA 
ATTGCTGTTAATACTAATGAAAGGACTAAAGAGGGGCATAAATCTGTTGTTAAGGCTATTGAGGCAATGACTGTAA 
TTACTGAAAAAATTGGAATTATTGATGAGATAACAAGGCAAACCAATTTGCTTGCTTTAAATGCCTCGATTGAAGC 
TGCACGAGTGGGAGAAAAGGGCAAGGGATTTGAAGTGGTAGCTGCTGAGGTTAGAAAGCTTGCAGATCAAAGCAAA 
GAATCAGCAAGAGAGATTATTGATATTGCAAACAGAAGTTTAACTGTTGCAAGTCGTGCTGGGGAAAATTTTGAAC 
AAATAGTTCCTGGTATGGAACAAACAGCCAGACTTGTAAAAAATATTTCTAATGAAAGTTATAAGCAAAGTGTTCA 
AATAGAGCAATTTAAAAATGCAATAGAGCAGGTTAGTCAGTTAGTCCAAACTACAGCCTCAAGCAGTGAAGAGCTT 
TCTGCAATGTCTGAAAAGATGTTAGAGAGTGTAAAAGATTTAAAAGAATCTGTTGATTATTTTAAGATCGAAAAGT 
AA 

tl96 .nt 



ATGCTTATTAATAGTAAATTGGTGGATCAACAGTTTAATCTTATGATAAATCTTATTGAAAGCATTAAAAGTTCTT 
TTAATCTTTACATCTCTTCAATGGAAGAGAAAGTTAGGGTTAGTTCCATGTATTTCAACTCTGCTGAAAAATTTAA 
TGAGGCTAGTAAAATTAAATCCAAAAGGTTGAGCTTTATTTCAGATCAATCTGAAATTCTTATTCAAACCGGTAGT 
AATATGATGGTTACAGACAAAGAAGGTAAAATAGTGTTTACTACGGCGGTTAAGGATAATAGTGATTTTGGCAAAT 
CTATTGGGGATAGAGAATATTTTACAAAACTTAAGGAGTCTAATAGTATTGTTTACAATTCCTTTGTCATGTTGGC 
AGATCCCGGGTCTATTGAGGAGTCTTTACTTAAAGATATTTCCAAGATAAAAAATAAAAAAGGTCAGATTCCTTAC 
ATATTAATAGGTATGCCATTAAGAGATTTTGAAACAGATAACATTTTTGGTTATTTTATGTTTCTTTATTCAATGG 
ATTATATATATAGGTCTTTTAGAGGGATTAATTTTGGAATACTCTCTAGCGGTCGTGCGCTAGCTTATGATACTAC 
GGGTAGATTGTTGGTTCATCATGTAGTATTGCCAGG'TGATATTTTGACTGATATTAGTGCTTCTTATTCCAATATT 
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ATTAAGAAAACATCTGAAGATTTGTTGCAAAAGAATAAAGAAATTTCAACTGTTTATTATTATGATCCTAAAAGCA 
ATAAGAAATATGTGGGAATTAGTCAAAAGGTGTTATTAAACTTGTCTAATAATAAATTTATTCTTTTAATGAGAAC 
TTCAGAGGACGATTTTTATTACATGTCACGAGCTACAACTATAATCTTAGCAATTAGTTTTGTATTTACATTACTT 
ATGCTTGCTATTGCAACTCTTTATCTTGTGAAAAAGTTAAGCTCTTCTTTGAATAAGATACTGGAATATTCTGAGA 
GACTTGCTTCTGGTAATTTTACTGCTGATATTAATTTTGGCAAATGGGATACTGTAGAGCTTTACAGTTTGTACGA 
AGGGCTTGAGCAGTTGAGAACCAATTTTTCTTCAGTTGCAAAAGGAGTTATTGAAAATCTAGATTATCTTTATGAA 
AATGCAATTCAAATAGCAAATGCAAGCCAGAATTTAAGTTCTGGCGCTGTTGAGCAGGCTTCTACTTTAGAGCAAA 
TGACAGCAAATATTGAGCAAATTTCACAAGGTGTTTCTGAGAATACTGAAAATGCAGCTACTACTGAAAAAATTGC 
TGTTAATACTAATGAAAGGACTAAAGAGGGGCATAAATCTGTTGTTAAGGCTATTGAGGCAATGACTGTAATTACT 
GAAAAAATTGGAATTATTGATGAGATAACAAGGCAAACCAATTTGCTTGCTTTAAATGCCTCGATTGAAGCTGCAC 
GAGTGGGAGAAAAGGGCAAGGGATTTGAAGTGGTAGCTGCTGAGGTTAGAAAGCTTGCAGATCAAAGCAAAGAATC 
AGCAAGAGAGATTATTGATATTGCAAACAGAAGTTTAACTGTTGCAAGTCGTGCTGGGGAAAATTTTGAACAAATA 
GTTCCTGGTATGGAACAAACAGCCAGACTTGTAAAAAATATTTCTAATGAAAGTTATAAGCAAAGTGTTCAAATAG 
AGCAATTTAAAAATGCAATAGAGCAGGTTAGTCAGTTAGTCCAAACTACAGCCTCAAGCAGTGAAGAGCTTTCTGC 
AATGTCTGAAAAGATGTTAGAGAGTGTAAAAGATTTAAAAGAATCTGTTGATTATTTTAAGATCGAAAAGTAA 



MRFIIAFLMILNQGFSNLFSLPPEDIIFESSYEVAIKKAQKLNKNVLILVGRDIKENLIKDFLNSFTNGEIIHKVS 
RKSVFL VIDKDNEI FNKINLQKS PTI FFVDSKNEQ IKAAYVGAVLS SVQFDKDFLNYWGAIKSTS VLKKQKDYEI 
NTADERTFFYKTLKGDWRLKFNGKDRKLVLFDTDLKEFLVFKDINENKLYAIPKSRIGNIYFSLLGNEEWKLFGKI 
K 



ATGAGATTTATAATTGCATTTTTAATGATTTTAAATCAAGGATTTTCAAATTTGTTTTCTTTGCCTCCGGAAGATA 
TTATTTTTGAGAGTTCTTATGAGGTTGCAATTAAAAAAGCTCAAAAATTGAATAAAAATGTTTTAATTTTGGTTGG 
TAGAGATATTAAAGAAAATTTAATAAAAGATTTTTTAAACTCTTTTACAAATGGTGAAATTATTCACAAAGTATCT 
AGAAAAAGTGTTTTTTTAGTTATTGATAAGGATAATGAAATTTTTAATAAAATTAATCTACAAAAAAGTCCGACTA 
TTTTTTTTGTTGATTCTAAGAATGAGCAAATAAAGGCAGCTTATGTGGGAGCTGTTTTGAGCAGTGTTCAATTTGA 
TAAGGATTTTTTAAACTATGTTATGGGAGCTATAAAATCAACAAGTGTTTTAAAAAAGCAAAAAGATTATGAAATT 
AATACTGCTGATGAGAGAACCTTTTTTTACAAAACATTAAAAGGTGATTGGCGATTAAAGTTTAATGGTAAAGACA 
GAAAGCTTGTTCTTTTTGATACAGATCTTAAAGAATTTTTAGTTTTTAAAGATATTAATGAAAACAAGCTTTATGC 
TATTCCTAAGTCTAGGATTGGTAATATTTATTTTTCATTATTGGGAAATGAAGAATGGAAGCTTTTTGGAAAAATA 
AAATAA 



TTGCCTCCGGAAGATATTATTTTTGAGAGTTCTTATGAGGTTGCAATTAAAAAAGCTCAAAAATTGAATAAAAATG 
TTTTAATTTTGGTTGGTAGAGATATTAAAGAAAATTTAATAAAAGATTTTTTAAACTCTTTTACAAATGGTGAAAT 
TATTCACAAAGTATCTAGAAAAAGTGTTTTTTTAGTTATTGATAAGGATAATGAAATTTTTAATAAAATTAATCTA 
CAAAAAAGTCCGACTATTTTTTTTGTTGATTCTAAGAATGAGCAAATAAAGGCAGCTTATGTGGGAGCTGTTTTGA 
GCAGTGTTCAATTTGATAAGGATTTTTTAAACTATGTTATGGGAGCTATAAAATCAACAAGTGTTTTAAAAAAGCA 
AAAAGATTATGAAATTAATACTGCTGATGAGAGAACCTTTTTTTACAAAACATTAAAAGGTGATTGGCGATTAAAG 
TTTAATGGTAAAGACAGAAAGCTTGTTCTTTTTGATACAGATCTTAAAGAATTTTTAGTTTTTAAAGATATTAATG 
AAAACAAGCTTTATGCTATTCCTAAGTCTAGGATTGGTAATATTTATTTTTCATTATTGGGAAATGAAGAATGGAA 
GCTTTTTGGAAAAATAAAATAA 



f399 .aa 



t899 . aa 



f 399 .nt 



t899 .nt 
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f924.aa 



MQDRKFSFRKYFLISVFLIFIVSGITYFYSTQMLEKSQKCVEDNLDAKVKLVDMEDFYFDLNECLNMDDFFIPRPD 
FLNENLNKNLWDGLIKNKFLDENFFKDLWIKKENLFNVDIEKENEKLIDKILEISK 



t924 .aa 



TQMLEKSQKCVEDNLDAKVKLVDMEDFYFDLNECLNMDDFFIPRPDFLNENLNKNLVVDGLIKNKFLDENFFKDLW 
I KKENL FNVD I EKENEKL I DK I LE I SK 



f924.nt 



ATGCAAGATAGAAAGTTTAGTTTTAGAAAATATTTTTTAATTTCAGTATTTTTGATTTTTATTGTTTCTGGTATTA 
CTTATTTCTATTCAACACAAATGTTGGAAAAATCTCAAAAGTGTGTTGAAGACAATTTAGACGCTAAGGTTAAATT 
AGTTGATATGGAAGATTTTTATTTTGATTTAAATGAATGTCTAAATATGGATGATTTTTTTATTCCAAGACCTGAT 
TTTTTAAATGAAAATTTAAATAAGAATTTAGTTGTTGATGGATTGATTAAAAATAAATTTCTTGATGAGAATTTTT 
TCAAGGATCTTTGGATTAAAAAGGAAAATTTATTTAACGTTGATATTGAGAAGGAGAATGAAAAATTAATAGATAA 
GATTTTAGAAATTTCCAAATGA 



t924 .nt 



ACACAAATGTTGGAAAAATCTCAAAAGTGTGTTGAAGACAATTTAGACGCTAAGGTTAAATTAGTTGATATGGAAG 
ATTTTTATTTTGATTTAAATGAATGTCTAAATATGGATGATTTTTTTATTCCAAGACCTGATTTTTTAAATGAAAA 
TTTAAATAAGAATTTAGTTGTTGATGGATTGATTAAAAATAAATTTCTTGATGAGAATTTTTTCAAGGATCTTTGG 
ATTAAAAAGGAAAATTTATTTAACGTTGATATTGAGAAGGAGAATGAAAAATTAATAGATAAGATTTTAGAAATTT 
CCAAATGA 

f925.aa 

MIRKYLIYISLLFIVFEA/YSKPAFISQDDSYELDFSSGEVDISVNTNSKFNLSFKDESWIYIKSIENEAFIKLIGE 
SYDNGAVFTFQTFKKEGKIKLVFTYQNVKDSSEFNKIIILKITKNFEVAIPQGVGGGSSRJDNNIETGNNLELGGGS 
ISGATSKEIIVRALNL5YINDYKGAIDLLNKYNFNDDKYILLKAEIHYKJSIGDYLKSYENYLKLKSKYFQSIVFDLI 
RLAIELNIKEEVLENARYLVEKNVDFSESIYLEIFEFLVTRGEHEFALNFSSLYFPKYINSSFSDKYSYLLGKLYE 
S E S KHKDFLKALHY YKLVIDNYP F S Y YYERAK IRYL FLKRF F 

t925 .aa 

KPAFISQDDSYELDFSSGEVDISVNTNSKFNLSFKDESWIYIKSIENEAFIKLIGESYDNGAVFTFQTFKKEGKIK 
LVFTYQWKDSSEFNKIIILKITKNFEVAIPQGVGGGSSRDNNIETGNNLELGGGSISGATSKEIIVRALNLSYIN 
DYKGAIDLLNKYNFNDDKYILLKAEIHYKNGDYLKSYENYLKLKSKYFQSIVFDLIRLAIELNIKEEVLENARYLV 
EKNVDFSESIYLEIFEFLVTRGEHEFALNFSSLYFPKYINSSFSDKYSYLLGKLYESESKHKDFLKALHYYKLVID 
NYPFSYYYERAKIRYLFLKRFF 

f925.nt 

ATGATTAGAAAATATTTGATTTATATAAGTTTGCTATTTATTGTTTTTGAAGTTTACTCTAAGCCAGCTTTTATAA 
GTCAAGACGATTCGTATGAGCTTGATTTTAGTAGTGGAGAGGTAGATATTAGTGTAAATACCAATTCAAAATTTAA 
TCTTTCTTTTAAAGATGAGTCTTGGATTTATATCAAAAGCATTGAAAATGAAGCTTTTATTAAGTTAATTGGAGAA 
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TCTTATGATAACGGTGCTC-TTTTTACTTTTCAGACTTTTAAAAAAGAAGGCAAAATTAAATTGGTTTTCACTTATC 
AAAATGTTAAAGATTCAAGTGAATTTAATAAAATAATTATCTTGAAAATTACAAAGAATTTTGAAGTTGCAATTCC 
ACAAGGCGTTGGTGGTGGCTCTAGCAGGGACAATAACATTGAAACTGGTAATAATCTTGAACTTGGGGGGGGGAGT 
ATTAGCGGGGCAACTTCTAAAGAGATTATTGTTAGGGCTTTAAATTTGTCCTACATAAATGATTACAAAGGAGCAA 
TAGATTTGCTTAATAAGTATAATTTCAATGACGATAAATATATTTTATTGAAGGCGGAAATTCATTATAAAAATGG 
TGATTATTTAAAATCTTATGAAAATTATTTGAAATTGAAGAGTAAATATTTTCAAAGCATTGTTTTTGATCTAATT 
AGGCTTGCTATAGAATTAAATATTAAAGAAGAGGTTTTAGAGAACGCTAGATATTTAGTTGAAAAGAATGTTGATT 
TTTCTGAGAGCATTTATCTTGAGATCTTTGAATTCTTAGTAACAAGC-GGAGAGCATGAGTTTGCTTTAAATTTTAG 
CTCTCTTTACTTTCCTAAGTATATTAATTCAAGCTTTTCAGACAAATATAGTTATTTGTTGGGAAAACTTTATGAG 
TCTGAGAGCAAGCATAAAGATTTTTTAAAGGCTTTGCATTACTATAAATTGGTTATTGATAATTACCCTTTTAGTT 
ATTATTATGAGAGAGCCAAGATAAGATATTTATTTTTAAAGCGGTTTTTTTAG 

t925.nt 

AAGCCAGCTTTTATAAGTCAAGACGATTCGTATGAGCTTGATTTTAGTAGTGGAGAGGTAGATATTAGTGTAAATA 
CCAATTCAAAATTTAATCTTTCTTTTAAAGATGAGTCTTGGATTTATATCAAAAGCATTGAAAATGAAGCTTTTAT 
TAAGTTAATTGGAGAATCTTATGATAACGGTGCTGTTTTTACTTTTCAGACTTTTAAAAAAGAAGGCAAAATTAAA 
TTGGTTTTCACTTATCAAAATGTTAAAGATTCAAGTGAATTTAATAAAATAATTATCTTGAAAATTACAAAGAATT 
TTGAAGTTGCAATTCCACAAGGCGTTGGTGGTGGCTCTAGCAGGGACAATAACATTGAAACTGGTAATAATCTTGA 
ACTTGGGGGGGGGAGTATTAGCGGGGCAACTTCTAAAGAGATTATTGTTAGGGCTTTAAATTTGTCCTACATAAAT 
GATTACAAAGGAGCAATAGATTTGCTTAATAAGTATAATTTCAATGACGATAAATATATTTTATTGAAGGCGGAAA 
TTCATTATAAAAATGGTGATTATTTAAAATCTTATGAAAATTATTTGAAATTGAAGAGTAAATATTTTCAAAGCAT 
TGTTTTTGATCTAATTAGGCTTGCTATAGAATTAAATATTAAAGAAGAGGTTTTAGAGAACGCTAGATATTTAGTT 
GAAAAGAATGTTGATTTTTCTGAGAGCATTTATCTTGAGATCTTTGAATTCTTAGTAACAAGGGGAGAGCATGAGT 
TTGCTTTAAATTTTAGCTCTCTTTACTTTCCTAAGTATATTAATTCAAGCTTTTCAGACAAATATAGTTATTTGTT 
GGGAAAACTTTATGAGTCTGAGAGCAAGCATAAAGATTTTTTAAAGGCTTTGCATTACTATAAATTGGTTATTGAT 
AATTACCCTTTTAGTTATTATTATGAGAGAGCCAAGATAAGATATTTATTTTTAAAGCGGTTTTTTTAG 



f929.aa 



MTKVLWSAIALLSKDKELIPFYKFLFLFFFFTLLACSKVSKDFIVFNKDVKTSSRIDNPNSNVLEVNKMEDFFGD 
IIDLKGYKILSVQQENLNLDVYFEQWLAQNFSNLNAYLF1IGFDPKIKAGTILFKTQIDIDPKNSYNMYLEDITG 
DYDFNIVIQGFLKDKSVLYVFQKSVLNDVSSYRPIFFDKVNGTVLINKYARSSAYEENRSRESYPISLEKYEKVGE 
DLIISKIEKYEYSIWQGRYCLSSVSEKVGKIDNNIYKTLKNLSKDEVYKFLHGVWYDVHDYNKMHVKDIDEVLFLS 
FERQSSEINLFRKWSQEVAKIEYISKPAYNTLNVSAKSLFSDLIVYNFWIKIVDKENIEIKIDTSTNSYDNSGFSG 
TFKRFDEWLNVKKGSSDIYFIPSGNYVYKDKIYDFSYPHLTYIDENKIYYGIFNIFPLKNNFVLEYEIDMGSYKL 
VESFFLEHSERIVQKQKFSTIILNPIKILKDDVSLVKGQKLKLERIEKI 



t929 .aa 



KDKELIPFYKFLFLFFFFTLLACSKVSKDFIVFNKDVKTSSRIDNPNSNVLEVNKMEDFFGDIIDLKGYKILSVQQ 
ENLNLDVYFEQWLAQNFSNLNAYLFIIGFDPKIKAGTILFKTQIDIDPKNSYNMYLEDITGDYDFNIVIQGFLKD 
KSVLYVFQKSVLNDVSSYRPIFFDKVNGTVLINKYARSSAYEENRSRESYPISLEKYEKVGEDLIISKIEKYEYSN 
VQGRYCLSSVSEKVGKIDmiYKTLKNLSKDEVYKFLHGVWYDVHDYNKMHVKDIDEVLFLSFERQSSEINLFRKN 
SQEVAKIEYISKPAYNTLWSAKSLFSDLIVYNFWIKIVDKENIEIKIDTSTNSYDNSGFSGTFKRFDENVLNVKK 
GSSDIYFIPSGNYVYKDKIYDFSYPHLTYIDENKIYYGIFNIFPLKNNFVLEYEIDMGSYKLVESFFLEHSERIVQ 
KQKFST I I LNP IKI LKDDVS L VKGQKLKLERIEKI 



f929 .nt 
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ATGACAAAGGTTTTGGTTGTTAGTGCGATTGCTCTTCTGAGTAAGGATAAAGAATTAATCCCATTTTATAAATTTT 
TGTTTTTATTCTTTTTTTTTACATTACTTGCTTGTTCCAAGGTAAGCAAAGATTTTATTGTTTTTAACAAAGATGT 
AAAGACTTCTTCCAGGATCGATAATCCAAATTCCAATGTTTTAGAAGTTAATAAAATGGAAGATTTTTTTGGAGAT 
ATTATAGATTTAAAAGGTTATAAAATTCTTTCAGTTCAGCAGGAAAATTTAAATTTAGATGTGTATTTTGAGCAGG 
TGGTTTTAGCTCAAAATTTTTCAAATCTTAATGCATATTTGTTTATTATTGGTTTTGATCCTAAAATTAAAGCTGG 
AACGATTCTTTTTAAAACTCAAATAGATATTGATCCAAAAAATTCTTATAACATGTATCTTGAAGATATTACAGGT 
GATTATGATTTTAATATAGTTATTCAAGGATTTTTAAAAGATAAATCTGTTTTGTATGTTTTTCAAAAATCTGTTT 
TAAATGATGTGTCTTCTTATAGGCCTATATTTTTTGACAAAGTTAATGGAACTGTTCTTATTAATAAGTATGCAAG 
ATCTTCAGCTTATGAAGAAAACAGATCAAGAGAAAGCTATCCTATTTCTTTAGAAAAATATGAAAAAGTGGGGGAA 
GATTTAATAATTAGCAAGATTGAAAAATATGAATATTCTAATGTTCAGGGTAGATATTGTCTTTCTTCTGTGAGCG 
AAAAAGTTGGTAAAATTGATAATAATATTTATAAAACTTTAAAGAATTTAAGCAAAGATGAAGTTTATAAATTTTT 
GCATGGAGTTTGGTATGATGTTCATGACTATAATAAAATGCATGTCAAAGATATTGATGAAGTTTTATTCTTGTCT 
TTTGAAAGGCAATCAAGCGAGATTAATCTTTTCAGGAAAAATTCTCAAGAAGTTGCAAAGATTGAATATATTTCAA 
AACCTGCTTACAATACTCTTAATGTTAGTGCAAAGTCTCTTTTTTCAGATTTGATAGTTTATAACTTTTGGATCAA 
AATTGTAGATAAAGAAAACATTGAAATCAAAATTGACACTAGCACAAATTCTTATGATAATAGTGGATTTTCGGGT 
ACATTTAAGAGGTTTGATGAGAATGTCTTAAATGTTAAAAAAGGGAGTAGTGATATTTATTTTATTCCTAGTGGAA 
ATTACGTC-TATAAGGATAAAATTTATGATTTTTCTTACCCCCATTTAACTTATATTGATGAGAATAAAATTTATTA 
TGGCATTTTTAATATTTTTCCTTTAAAAAATAATTTTGTTCTTGAATATGAGATTGACATGGGTAGTTACAAGCTT 
GTTGAATCTTTTTTCCTTGAGCATAGCGAAAGAATTGTTCAAAAGCAAAAATTTTCTACAATCATTTTAAATCCTA 
TTAAAATTTTAAAAGATGATGTAAGCTTAGTTAAAGGGCAAAAATTAAAGCTTGAGCGAATAGAAAAAATATGA 



AAGGATAAAGAATTAATCCCATTTTATAAATTTTTGTTTTTATTCTTTTTTTTTA.CATTACTTGCTTGTTCCAAGG 
TAAGCAAAGATTTTATTGTTTTTAACAAAGATGTAAAGACTTCTTCCAGGATCGATAATCCAAATTCCAATGTTTT 
AGAAGTTAATAAAATGGAAGATTTTTTTGGAGATATTATAGATTTAAAAGGTTATAAAATTCTTTCAGTTCAGCAG 
GAAAATTTAAATTTAGATGTGTATTTTGAGCAGGTGGTTTTAGCTCAAAATTTTTCAAATCTTAATGCATATTTGT 
TTATTATTGGTTTTGATCCTAAAATTAAAGCTGGAACGATTCTTTTTAAAACTCAAATAGATATTGATCCAAAAAA 
TTCTTATAACATGTATCTTGAAGATATTACAGGTGATTATGATTTTAATATAGTTATTCAAGGATTTTTAAAAGAT 
AAATCTGTTTTGTATGTTTTTCAAAAATCTGTTTTAAATGATGTGTCTTCTTATAGGCCTATATTTTTTGACAAAG 
TTAATGGAACTGTTCTTATTAATAAGTATGCAAGATCTTCAGCTTATGAAGAAAACAGATCAAGAGAAAGCTATCC 
tATTTCTTTAGAAAAATATGAAAAAGTGGGGGAAGATTTAATAATTAGCAAGATTGAAAAATATGAATATTCTAAT 
GTTCAGGGTAGATATTGTCTTTCTTCTGTGAGCGAAAAAGTTGGTAAAATTGATAATAATATTTATAAAACTTTAA 
AGAATTTAAGCAAAGATGAAGTTTATAAATTTTTGCATGGAGTTTGGTATGATGTTCATGACTATAATAAAATGCA 
•-TGTCAAAGATATTGATGAAGTTTTATTCTTGTCTTTTGAAAGGCAATCAAGCGAGATTAATCTTTTCAGGAAAAAT 
TCTCAAGAAGTTGCAAAGATTGAATATATTTCAAAACCTGCTTACAATACTCTTAATGTTAGTGCAAAGTCTCTTT 
TTTCAGATTTGATAGTTTATAACTTTTGGATCAAAATTGTAGATAAAGAAAACATTGAAATCAAAATTGACACTAG 
C ACAAATTC TTATGATAATAGTGGATTTTC GGGTAC ATTTAAGAGGTTTGATGAGAATGTC TTAAATGTTAAAAAA 
GGGAGTAGTGATATTTATTTT ATTCCTAGTGGAAATTAC GTGTATAAGGATAAAATTT ATGATTTTTC TTAC CC C C 
ATTTAACTTATATTGATGAGAATAAAATTTATTATGGCATTTTTAATATTTTTCCTTTAAAAAATAATTTTGTTCT 
TGAATATGAGATTGACATGGGTAGTTACAAGCTTGTTGAATCTTTTTTCCTTGAGCATAGCGAAAGAATTGTTCAA 
AAGCAAAAATTTTCTACAATCATTTTAAATCCTATTAAAATTTTAAAAGATGATGTAAGCTTAGTTAAAGGGCAAA 
AAT.TAAAGCTTGAGCGAATAGAAAAAATATGA 



MNKLLIFVLATFCVFSSFAQANDSKNGAFGMSAGEKLLVYETSKQDPIVPFLLNLFLGFGIGSFAQGDILGGSLIL 

GFDAVGIGLILAGAYLDIKALIX3ITKKAAFQWTWGKGVMLAGWTMAVTRLTEIILPFTFANSYNRKLKNS 

GGFEPSFDVAMGQSSALGFELSFKKSY 



t929.nt 



f933 .aa 



t933 . 



aa 





73 

TABLE 1 . Nucleotide and Amino Acid Sequences 



NDSKNGAFGMSAGEKLLVYETSKQDPIVPFLLNLFLGFGIGSFAQGDILGGSLILGFDAVGIGLILAGAYLDIKAL 
IXSITKKAAFQWTWGKGVMLAGVVTMAVTRLTEIILPFTFANSYNRKLiCNSLIWALGGFEPSFDVAMGQSSALGFEL 
SFKKSY 



ATGAATAAACTTTTAATTTTTGTTTTGGCAACCTTTTGTGTTTTTTCTAGCTTTGCTCAAGCTAATGATTCTAAAA 
ATGGTGCGTTTGGGATGAGTGCTGGAGAAAAACTTTTGGTTTATGAAACTAGCAAGCAAGATCCTATTGTACCATT 
TTTATTGAACCTTTTTTTAGGGTTTGGAATAGGCTCCTTTGCTCAAGGAGATATTCTTGGAGGTTCTCTTATTCTT 
GGATTTGATGCGGTTGGTATAGGGCTTATACTTGCGGGGGCTTATTTGGATATCAAAGCGCTTGATGGTATTACTA 
AAAAAGCTGCTTTTCAATGGACTTGGGGTAAGGGAGTTATGTTAGCAGGTGTGGTTACTATGGCTGTGACAAGATT 
AACAGAAATTATTCTTCCATTTACATTTGCTAATAGTTATAATAGGAAGCTAAAAAATAGCCTTAATGTAGCTTTA 
GGAGGATTTGAACCTAGTTTTGATGTTGCAATGGGCCAATCCAGTGCTCTTGGGTTTGAACTGTCTTTCAAAAAAA 
GCTATTAA 



AATGATTCTAAAAATGGTGCGTTTGGGATGAGTGCTGGAGAAAAACTTTTGGTTTATGAAACTAGCAAGCAAGATC 
CTATTGTACCATTTTTATTGAACCTTTTTTTAGGGTTTGGAATAGGCTCCTTTGCTCAAGGAGATATTCTTGGAGG 
TTCTCTTATTCTTGGATTTGATGCGGTTGGTATAGGGCTTATACTTGCGGGGGCTTATTTGGATATCAAAGCGCTT 
GATGGTATTACTAAAAAAGCTGCTTTTCAATGGACTTGGGGTAAGGGAGTTATGTTAGCAGGTGTGGTTACTATGG 
CTGTGACAAGATTAACAGAAATTATTCTTCCATTTACATTTGCTAATAGTTATAATAGGAAGCTAAAAAATAGCCT 
TAATGTAGCTTTAGGAGGATTTGAACCTAGTTTTGATGTTGCAATGGGCCAATCCAGTGCTCTTGGGTTTGAACTG 
TCTTTCAAAAAAAGCTATTAA 



MRKYIFIILIAVLLIGVMIKKIAAAANIDRHTNSTLGIDLSVGIPIFYNDLSKAYPTNLYPGGIGAIKYQYHILNN 
LAIGLELRYMFNFDINHSFNILNPDSSVGKIFYSVPITFSINYIFDIGELFQIPVFTNIGFSLNTYGDRNNNITNL 
RTFD ALPT I S FGS G I LWNFNYKWAFGATAS WWMMF EFGNS AKMAHFALVS LS VT VNVNKL 



ANIDRHTNSTLGIDLSVGIPIFYNDLSKAYPTNLYPGGIGAIKYOYHILMNLAIGLELRYMFNFDINHSFNILNPD 
SSVGKIFYSVPITFSIlSrYIFDIGELFQIPVFTNIGFSLNTYGDRNNNITNLRTFDALPTISFGSGILWNFNYKWAF 
GATASWWMMFEFGNSAKMAHFALVSLSVTVNVNKL 



ATGAGAAAGTATATTTTTATAATACTAATTGCAGTCTTGCTAATTGGTGTAAACATAAAAAAAATTGCGGCCGCAG 
CCAATATTGATAGGCATACAAACTCCACTTTAGGAATAGATTTAAGTGTAGGAATCCCTATTTTTTACAACGACTT 
ATCAAAAGCTTATCCTACCAATTTATATCCAGGAGGTATTGGGGCAATAAAATACCAGTACCATATTTTAAACAAT 
TTAGCAATTGGACTTGAACTAAGGTATATGTTTAACTTTGATATTAACCATTCTTTTAATATATTAAATCCAGATT 
CAAGTGTAGGTAAAATTTTTTATAGCGTGCCTATTACATTTTCAATAAATTATATATTTGATATAGGAGAATTATT 



£933 .nt 



t933 .nt 



f940 .aa 



t940 .aa 



f940.nt 
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TABLE 1. Nucleotide and Amino Acid Sequences 

TCAAATTCCAGTCTTCACAAATATAGGGTTTTCTCTTAATACATATGGAGATAGAAACAACAATATTACAAATTTA 
AGAACTTTTGATGCACTCCCTACAATCTCTTTTGGATCTGGAATTTTATGGAACTTTAACTATAAATGGGCTTTTG 
GAGCAACAGCATCTTGGTGGATGATGTTTGAATTTGGAAATTCTGCTAAAATGGCACATTTTGCACTTGTATCATT 
ATCAGTTACAGTGAATGTAAATAAATTGTAG 



GCCAATATTGATAGGCATACAAACTCCACTTTAGGAATAGATTTAAGTGTAGGAATCCCTATTTTTTACAACGACT 
TATCAAAAGCTTATCCTACCAATTTATATCCAGGAGGTATTGGGGCAATAAAATACCAGTACCATATTTTAAACAA 
TTTAGCAATTGGACTTGAACTAAGGTATATGTTTAACTTTGATATTAACCATTCTTTTAATATATTAAATCCAGAT 
TCAAGTGTAGGTAAAATTTTTTATAGCGTGCCTATTACATTTTCAATAAATTATATATTTGATATAGGAGAATTAT 
TTCAAATTCCAGTCTTCACAAATATAGGGTTTTCTCTTAATACATATGGAGATAGAAACAACAATATTACAAATTT 
AAGAACTTTTGATGCACTCCCTACAATCTCTTTTGGATCTGGAATTTTATGGAACTTTAACTATAAATGGGCTTTT 
GGAGCAACAGCATCTTGGTGGATGATGTTTGAATTTGGAAATTCTGCTAAAATGGCACATTTTGCACTTGTATCAT 
TATCAGTTACAGTGAATGTAAATAAATTGTAG 



MKNQFLNSYFQLITTIFLISSITIAAEEITSTLKVPNGFKVEIFLNNTIEKPRGITSDQDGNIFIGSGSTFAYFVT 
KNRKIYTIAKTLQKPIGIDYTONKLYISSVDKIYWK^ 

KVDSKNNKLIVNIGSQHNVKIPPKKEAVILSINLKTKKEEIVAFGVRNSVGFDFHPISNEIYFSDNGQDGLGDNIP 
PDEIWITEYKEHFGFPYVFGKNQKNYGFYNKAPKNTKFIPSIYELPAHVAPLGIHFYRGNNFPKEYINKLFIAEH 
GSWNRSSPVGYKITTLDIDSKTRTARNYKTFLYGFLKHDKSKFGRPVDIITYYDGSILFTDDFGNKIYRVYYEKI 



EITSTLKVPNGFKVEIFLNNTIEKPRGITSDQDGNIFIGSGSTFAYFVTKNRKIYTIAKTLQKPIGIDYWDNKLYI 
SSVDKIYVVKIWKEEJNKSIKSHKDYTWKMQIFALLPK^ 

VILSINLKTKKEEIVAFGVRNSVGFDFHPISNEIYFSDNGQDGLGDNIPPDEINVITEYKEHFGFPYVFGKNQKNY 
GFYNKAPKNTKFIPSIYELPAHVAPLGIHFYRGNNFPKEYINKLFIAEHGSWNRSSPVGYKITTLDIDSKTRTARN 
YKTFLYGFLKHDKSKFGRPVDIITYYDGSILFTDDFGNKIYRVYYEKI 



ATGAAAAATCAATTTTTAAATAGCTATTTTCAATTAATTACAACTATTTTCTTAATCTCATCTATAACTATTGCAG 
CAGAAGAAATAACAAGCACACTAAAAGTTCCTAATGGATTTAAAGTCGAAATTTTTTTAAACAATACAATTGAAAA 
ACCTAGAGGAATCACAAGCGATCAAGATGGAAATATATTCATAGGATCTGGAAGCACTTTTGCATACTTTGTAACA 
AAAAACAGAAAAATTTATACCATAGCAAAAACCCTGCAAAAACCTATTGGTATTGATTATTGGGATAATAAACTCT 
ACATATCTTCTGTCGATAAAATATATGTAGTTAAAAATGTAAAAGAAGAAATTAATAAAAGCATAAAATCACATAA 
AGACTATACATGGAAAATGCAAATTTTTGCACTTTTGCCAAAAAATAATTCTCAAATGCACTCAGGACGTTACATT 
AAAGTAGATTCTAAAAATAACAAATTAATAGTAAATATAGGATCCCAGCACAATGTTAAAATTCCCCCAAAAAAAG 
AAGCAGTAATCCTTAGTATTAATTTAAAAACAAAAAAAGAAGAAATAGTAGCTTTTGGAGTGAGAAACTCAGTTGG 
GTTTGATTTTCACCCAATTAGCAATGAAATATATTTTAGCGACAATGGCCAAGACGGATTAGGAGACAACATTCCC 
CCAGATGAAATAAACGTAATAACCGAATATAAAGAACATTTTGGATTTCCCTATGTGTTTGGAAAAAATCAAAAAA 
ATTACGGTTTTTATAACAAAGCACCCAAAAACACTAAGTTTATCCCATCTATTTACGAACTTCCCGCACATGTAGC 
TCCACTTGGAATACACTTTTACCGGGGAAATAACTTTCCAAAAGAATACATAAATAAATTATTCATAGCAGAACAC 
GGCTCGTGGAACAGATCTTCTCCTGTTGGCTACAAAATAACAACACTAGACATTGATTCTAAAACCAGAACAGCAA 



t940 .nt 



£ 943 . 



t943 .aa 



f943 .nt 
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GAAATTACAAGACTTTTTTATATGGATTTTTAAAGCACGACAAATCTAAATTTGGACGCCCTGTTGATATAATCAC 
ATATTATGACGGTTCAATTCTTTTTACAGATGACTTTGGAAATAAAATATACAGAGTTTACTACGAAAAGATTTAA 



GAAATAACAAGCACACTAAAAGTTCCTAATGGATTTAAAGTCGAAATTTTTTTAAACAATACAATTGAAAAACCTA 
GAGGAATCACAAGCGATCAAGATGGAAATATATTCATAGGATCTGGAAGCACTTTTGCATACTTTGTAACAAAAAA 
CAGAAAAATTTATACCATAGCAAAAACCCTGCAAAAACCTATTGGTATTGATTATTGGGATAATAAACTCTACATA 
TCTTCTGTCGATAAAATATATGTAGTTAAAAATGTAAAAGAAGAAATTAATAAAAGCATAAAATCACATAAAGACT 
ATACATGGAAAATGCAAATTTTTGCACTTTTGCCAAAAAATAATTCTCAAATGCACTCAGGACGTTACATTAAAGT 
AGATTCTAAAAATAACAAATTAATAGTAAATATAGGATCCCAGCACAATGTTAAAATTCCCCCAAAAAAAGAAGCA 
GTAATCCTTAGTATTAATTTAAAAACAAAAAAAGAAGAAATAGTAGCTTTTGGAGTGAGAAACTCAGTTGGGTTTG 
ATTTTCACCCAATTAGCAATGAAATATATTTTAGCGACAATGGCCAAGACGGATTAGGAGACAACATTCCCCCAGA 
TGAAATAAACGTAATAACCGAATATAAAGAACATTTTGGATTTCCCTATGTGTTTGGAAAAAATCAAAAAAATTAC 
GGTTTTTATAACAAAGCACCCAAAAACACTAAGTTTATCCCATCTATTTACGAACTTCCCGCACATGTAGCTCCAC 
TTGGAATACACTTTTACCGGGGAAATAACTTTCCAAAAGAATACATAAATAAATTATTCATAGCAGAACACGGCTC 
GTGGAACAGATCTTCTCCTGTTGGCTACAAAATAACAACACTAGACATTGATTCTAAAACCAGAACAGCAAGAAAT 
TACAAGACTTTTTTATATGGATTTTTAAAGCACGACAAATCTAAATTTGGACGCCCTGTTGATATAATCACATATT 
ATGACGGTTCAATTCTTTTTACAGATGACTTTGGAAATAAAATATACAGAGTTTACTACGAAAAGATTTAA 



MNYARFAVLIVLLFFYIWFFIILRMKRTNLFLLEKIQNGAKILDIRSPKEYSKSHYLKSINIPFNNLFAKKDKLGD 
FESPIIVYGKSFNKSYEAKKVLKSMGFKNVFVAGTLKDMPQAKKEVG 



RMKRTNLFLLEKIQNGAKILDIRSPKEYSKSHYLKSINIPFNNLFAKKDKLGDFESPIIVYGKSFNKSYEAKKVLK 
SMGFKNVFVAGTLKDMPQAKKEVG ~ 



ATGAATTATGCAAGATTTGCAGTATTAATAGTTCTGCTTTTTTTTTATATTTGGTTTTTTATTATCCTTAGGATGA 
AAAGAACTAATCTGTTTTTGTTAGAAAAAATCCAAAATGGAGCAAAAATTTTGGATATTCGGTCTCCCAAAGAATA 
TAGCAAGTCTCATTATTTGAAGTCAATTAACATTCCTTTTAATAATTTATTTGCTAAAAAGGATAAATTAGGTGAT 
TTTGAGTCCCCAATAATTGTTTATGGTAAAAGTTTTAATAAGTCTTACGAGGCTAAAAAAGTTTTAAAAAGCATGG 
GATTTAAGAATGTGTTTGTTGCTGGAACCTTGAAAGACATGCCAGAAGCAAAAAAAGAAGTTGGTTGA 



AGGATGAAAAGAACTAATCTGTTTTTGTTAGAAAAAATCCAAAATGGAGCAAAAATTTTGGATATTCGGTCTCCCA 
AAGAATATAGCAAGTCTCATTATTTGAAGTCAATTAACATTCCTTTTAATAATTTATTTGCTAAAAAGGATAAATT 
AGGTGATTTTGAGTCCCCAATAATTGTTTATGGTAAAAGTTTTAATAAGTCTTACGAGGCTAAAAAAGTTTTAAAA 
AGCATGGGATTTAAGAATGTGTTTGTTGCTGGAACCTTGAAAGACATGCCACAAGCAAAAAAAGAAGTTGGTTGA 



t943 .nt 



f952 .aa 



t952 .aa 



f952 .nt 



t952.nt 
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TABLE 1. Nucleotide and Amino Acid Sequences 
f378.aa 



MIKKFLLFAMLNIFLTNKAHSNEEIIEISTEIQKEKYIPFLISRGKTQLEDLVKYTLEINPELDKNYVNTVAKTYI 
DESLIEGVWYDIAYAQMLLETGALKF3MGIVSKSQHNFSGIGATNNLTKGNSFSNITEGIKAHIQHLKAYASKQNIK 
SNMVDPRFYLVKRGSAPTIYDLTGKWAKDKLYDKKLKKILLELLEYNNANKS 



NEEIIEISTEIQKEKYIPFLISRGKTQLEDLVKYTLEINPELDK3STYWTVAKTYIDESLIEGVNTOIAYAQMLLET 
GALKFNGIVSKEQHNFSGIGATNNLTKGNSFSNITEGIKAHIQHLKAYASKQNIKSNMVDPRFYLVKRGSAPTIYD 
LTGKWAKDKLYDKKLKKILLELLEYNNANKS 



.ATGATAAAAAAATTCTTGCTATTTGCAATGCTCAACATCTTTTTAACAAATAAAGCTCATAGTAATGAAGAGATAA 
TCGAAATAAGTACTGAAATACAAAAGGAAAAATATATTCCCTTTTTAATAAGTAGAGGAAAAACTCAACTAGAAGA 
CCTTGTAAAATATACTCTAGAAATAAATCGAGAGCTTGACAAAAACTATGTAAATACTGTTGCTAAAACCTATATA 
GACGAATCTTTGATTGAAGGGGTTAATTATGACATTGCCTATGCTCAAATGTTACTAGAAACAGGAGCTCTAAAAT 
TCAATGGAATAGTTTCAAAAGAACAACACAATTTTTCAGGAATAGGCGCTACTAATAATCTTACAAAAGGAAATTC 
TTTTTCCAATATTACAGAAGGAATTAAAGCTCATATTCAACATTTAAAAGCTTATGCTTCAAAACAAAATATCAAA 
TCAAATATGGTTGATCCTAGATTTTACCTTGTTAAAAGAGGATCTGCTCCAACAATATATGATTTGACTGGGAAAT 
GGGCAAAAGACAAACTTTACGACAAAAAACTTAAAAAAATATTATTAGAACTATTAGAATATAATAATGCAAATAA 
AAGCTAA 



AATGAAGAGATAATCGAAATAAGTACTGAAATACAAAAGGAAAAATATATTCCCTTTTTAATAAGTAGAGGAAAAA 
CTCAACTAGAAGACCTTGTAAAATATACTCTAGAAATAAATCCAGAGCTTGACAAAAACTATGTAAATACTGTTGC 
TAAAACCTATATAGACGAATCTTTGATTGAAGGGGTTAATTATGACATTGCCTATGCTCAAATGTTACTAGAAACA 
GGAGCTCTAAAATTCAATGGAATAGTTTCAAAAGAACAACACAATTTTTCAGGAATAGGCGCTACTAATAATCTTA 
CAAAAGGAAATTCTTTTTCCAATATTACAGAAGGAATTAAAGCTCATATTCAACATTTAAAAGCTTATGCTTCAAA 
ACAAAATATCAAATCAAATATGGTTGATCCTAGATTTTACCTTGTTAAAAGAGGATCTGCTCCAACAATATATGAT 
TTGACTGGGAAATGGGCAAAAGACAAACTTTACGACAAAAAACTTAAAAAAATATTATTAGAACTATTAGAATATA 
ATAATGCAAATAAAAGCTAA 



MKLFRRNVMIKMPSSFTIIFSLIVFVTILTYVIPAGKFDKEFKQMGDGSKREIIVAGTYQYVDRGSRGFLHPIMTI 
LTAMSKGMEHAVEVIVFVLIVGGAYGIIMKTGAIDVGIYFLIKKLGHKDKLLIPLLMFIFSIGGTVTGMSEETLPF 
YFVMI PL IVALGYDS LVGAAI I ALGAGVGTMASTVNPFATGI ASAI AS I SLQDGFYFRI VL YFVSVL AAITYVCVY 
ASKIKKDPSKSLVYSQKDEHYQYFVKKDGLSTGDNAQNALEFTFAHKLVLLLFGFMILILIFSIWLGWWMQEMTM 
LYLGVAIISAFICKLGETEMWDAFVKGSESLLTAALVIGLARGVMIVCDDGLITDTMLNAATNFLYNLPRPLFIIL 
NEIIQIFIGFWPSSSGHASLTMPIMAPLADFLSIPRASWIAMQTASGLINLITPTSGVIMAVLGISRLSYGTWF 
KFVLPLFMIEFFISILVIIANIYLSF 



t378 . 



f378.nt 



t378 .nt 



f4.aa 



t4 .aa 




77 

TABLE 1. Nucleotide and Amino Acid Sequences 

KFDKEFKQMGDGSKREIIVAGTYQYVDRGSRGFLHPIMTILTAMSKGMEHAVEVIVFVLIVGGAYGIIMKTGAIDV 
GIYFLIKKLGHKDKLLIPLLMFIFSIGGTVTGMSEETLPFYFVMIPLIVALGYDSLVGAAIIALGAGVGTMASTVN 
PFATGIASAIASISLQDGFYFRIVLYFVSVLAAITYVCVYASKIKKDPSKSLVYSQKDEHYQYFVKKDGLSTGDNA 
QNALEFTFAHKLVLLLFGFMILILIFSIVNLGWWMQEMTMLYLGVAIISAFICKLGETEMWDAFVKGSESLLTAAL 
VIGLARGVMIVCDDGLITDTMLNAATNFLYNLPRPLFIILNEIIQIFIGFWPSSSGHASLTMPIMAPLADFLSIP 
RASWIAMQTASGLINLITPTSGVIMAVLGISRLSYGTWFKFVLPLFMIEFFISILVIIANIYLSF 



f 4 .nt 



ATGAAATTATTTAGGAGAAACGTTATGATCAAAATGCCAAGTAGTTTTACAATAATATTTTCTTTAATTGTATTTG 
TTACCATTTTAACGTATGTGATTCCTGCCGGTAAGTTTGATAAAGAATTTAAGCAAATGGGTGATGGATCTAAAAG 
GGAAAT AATTGTTGC TGGAACTTATC AATATGTAGATCGAGGC TC T AGGGGATTTTT AC AT CCTATTATGAC TAT T 
TTAACCGCAATGTCAAAGGGGATGGAACATGCAGTTGAAGTTATTGTTTTTGTTTTAATTGTTGGGGGTGCTTATG 
GGATTATTATGAAAACTGGAGCAATAGATGTGGGAATTTATTTTTTAATCAAGAAGTTGGGGCACAAAGATAAGTT 
GCTTATTCCTTTGTTAATGTTTATTTTTTCAATTGGTGGAACTGTAACCGGAATGAGTGAAGAGACCCTTCCTTTT 
TATTTTGTTATGATTCCCTTGATAGTAGCTTTGGGTTATGATAGTCTTGTTGGAGCGGCTATTATTGCTTTAGGAG 
CTGGAGTGGGAACTATGGCTTCTACTGTAAATCCATTTGCGACAGGAATTGCATCTGCAATAGCTTCTATTAGCTT 
GCAGGATGGATTTTATTTTAGAATTGTTCTTTATTTTGTATCAGTATTGGCTGCTATAACCTATGTTTGTGTTTAT 
GCGTCTAAAATTAAAAAGGATCCCTCAAAATCGCTTGTGTATTCTCAAAAAGATGAACATTATCAATATTTTGTTA 
AAAAAGATGGACTTTCTACCGGAGATAATGCTCAGAATGCTCTTGAGTTTACTTTTGCTCATAAATTAGTTTTACT 
TTTATTTGGATTTATGATATTGATTTTGATATTTAGCATTGTTAATCTTGGTTGGTGGATGCAAGAAATGACAATG 
TTGTATCTTGGAC-TTGCTATTATATCGGCTTTTATTTGTAAATTAGGTGAAACTGAAATGTGGGATGCGTTTGTGA 
AAGGTTCTGAAAGTCTGCTAACCGCTGCTCTTGTTATTGGACTTGCTAGAGGTGTTATGATAGTATGTGATGATGG 
GT'TGATT ACAGATAC TATGTT AAATGCTGC TAC T AATTTTTTAT AC AATC TTC C AAGACCC CTTTTTATC AT ATTG 
AATGAAATTATTCAAATATTTATAGGATTTGTTGTTCCATCTTCATCAGGACATGCTAGTCTCACTATGCCAATAA 
TGGCTCCTCTTGCCGATTTTTTGTCAATTCCAAGAGCTTCAGTTGTTATTGCCATGCAGACTGCATCTGGGCTTAT 
TAATTTGATAACACCTACCAGCGGAGTTATAATGGCTGTATTGGGGATATCCAGATTGAGTTATGGTACGTGGTTT 
AAGTTTGTTTTACCATTATTTATGATTGAGTTTTTTATCTCTATTTTAGTTATTATAGCTAACATTTATTTAAGTT 
TTTAG 



t4 .nt 



AAGTTTGATAAAGAATTTAAGCAAATGGGTGATGGATCTAAAAGGGAAATAATTGTTGCTGGAACTTATCAATATG 
'TAGATCGAGGCTCTAGGGGATTTTTACATCCTATTATGACTATTTTAACCGCAATGTCAAAGGGGATGGAACATGC 
AGTTGAAGTTATTGTTTTTGTTTTAATTGTTGGGGGTGCTTATGGGATTATTATGAAAACTGGAGCAATAGATGTG 
GGAATTTATTTTTTAATCAAGAAGTTGGGGCACAAAGATAAGTTGCTTATTCCTTTGTTAATGTTTATTTTTTCAA 
TTGGTGGAACTGTAACCGGAATGAGTGAAGAGACCCTTCCTTTTTATTTTGTTATGATTCCCTTGATAGTAGCTTT 
GGGTTATGATAGTCTTGTTGGAGCGGCTATTATTGCTTTAGGAGCTGGAGTGGGAACTATGGCTTCTACTGTAAAT 
CCATTTGCGACAGGAATTGCATCTGCAATAGCTTCTATTAGCTTGCAGGATGGATTTTATTTTAGAATTGTTCTTT 
ATTTTGTATCAGTATTGGCTGCTATAACCTATGTTTGTGTTTATGCGTCTAAAATTAAAAAGGATCCCTCAAAATC 
GCTTGTGTATTCTCAAAAAGATGAACATTATCAATATTTTGTTAAAAAAGATGGACTTTCTACCGGAGATAATGCT 
CAGAATGCTCTTGAGTTTACTTTTGCTCATAAATTAGTTTTACTTTTATTTGGATTTATGATATTGATTTTGATAT 
TTAGCATTGTTAATCTTGGTTGGTGGATGCAAGAAATGACAATGTTGTATCTTGGAGTTGCTATTATATCGGCTTT 
TATTTGTAAATTAGGTGAAACTGAAATGTGGGATGCGTTTGTGAAAGGTTCTGAAAGTCTGCTAACCGCTGCTCTT 
GTTATTGGACTTGCTAGAGGTGTTATGATAGTATGTGATGATGGGTTGATTACAGATACTATGTTAAATGCTGCTA 
CTAATTTTTTATACAATCTTCCAAGACCCCTTTTTATCATATTGAATGAAATTATTCAAATATTTATAGGATTTGT 
TGTTCCATCTTCATCAGGACATGCTAGTCTCACTATGCCAATAATGGCTCCTCTTGCCGATTTTTTGTCAATTCCA 
AGAGCTTCAGTTGTTATTGCCATGCAGACTGCATCTGGGCTTATTAATTTGATAACACCTACCAGCGGAGTTATAA 
TGGCTGTATTGGGGATATCCAGATTGAGTTATGGTACGTGGTTTAAGTTTGTTTTACCATTATTTATGATTGAGTT 
TTTTATCTCTATTTTAGTTATTATAGCTAACATTTATTTAAGTTTTTAG 




TABLE 1 . Nucleotide and Amino Acid Sequences 



f 43 .aa 



MKYFYFLFFLLIF1WYAQNVWSPALPSPPLLPEITENKPVERENSSKGENFSNVGLDGKYVNDTILYGLDSQVTSI 
IKALKKSSDSQYNFSLKKRLEKTFNAELKREILELFISLKYSGGIDTANYILENYESKRYSNALFGLAISYLKEFD 
DKEKLKKTLIDILENKEGNWSIAAYYLGELNSLEYSKNmEVFEKYSGNDGARREILIALGKMSAVDYQDRIYEI 
SLDNYEGPSIKAAAIEALSYLASDKVTENADLYLQSNN3SINLNVKLAIIAS 

KAINAIKGKRDSSAKDILIYKLKSDPSLKVREASAKALIDMDLGNIEIKNIMFDFKIDMNFKISMFSYLLDKDSLK 
ALSIALEIVNKENINRPSWLRGVASMLAGKKGNFDNFYSKIIDSKNIDLRHLALKGAVYNKSSSLSDKLKKIKSE 
TNSEYIKMLLKDY 



t43 .aa 



LPSPPLLPEITENKPVERENSSKGENFSNVGLDGKYVNDTILYGLDSQVTSIIKALKKSSDSQYMFSLKKRLEKTF 
NAELKREILELFISLKYSGGIDTANYILENYESKRYSNALFGLAISYLKEFDDKEKLKKTLIDILENKEGNWSIA 
AYYLGELNSLEYSKNMMEVFEKYSGNDGARREILIALGKMSAVDYQDRIYEISLDNYEGPSIKAAAIEALSYLASD 
KVTENADLYLQSWNNNLlSJVKLAIIASLSKDPSLKSKEILQGFLRDSDDNrRFKAINAIKGHRDSSAKDILIYKLKS 
DPSLKVREASAKALIDMDLGNIEIKNIMFDFKIDN1MFKISMFSYLLDKDSLKALSIALEIVNKENINRPSNVLRGV 
ASMLAGKKGNFDNFYSKIIDSKNIDLRHLALKGAVYNKSSSLSDKLKKIKSETNSEYIKMLLKDY 



f43 .nt 



ATGAAATACTTTTATTTTTTATTTTTTTTACTTATTTTTAATGTGTATGCTCAAAATGTTAATTCTCCAGCTCTTC 
CTAGTCCGCCTTTGTTGCCCGAAATTACAGAAAATAAGCCTGTTGAGAGAGAAAATTCTTCTAAGGGAGAGAATTT 
TTCTAATGTTGGTTTAGATGGTAAGTATGTTAACGATACAATTCTTTATGGGCTTGATAGTCAAGTGACAAGCATT 
ATAAAAGCTCTTAAAAAATCAAGCGATAGTCAATATAATTTTTCTCTTAAAAAAAGACTTGAGAAAACTTTTAATG 
CTGAGCTTAAAAGGGAAATACTTGAATTGTTTATTTCTCTTAAGTATTCGGGGGGCATTGATACAGCAAATTATAT 
TCTTGAAAATTATGAGAGTAAAAGATATTCAAACGCTTTATTTGGCTTGGCAATTTCGTATCTTAAGGAGTTTGAT 
GATAAAGAAAAATTAAAAAAAACTCTTATTGACATTCTTGAAAATAAAGAGGGCAATGTGGTATCTATTGCAGCTT 
ATTATTTAGGAGAGCTTAATTCTCfTGAGTATTCTAAAAACATGATGGAAGTTTTTGAAAAATATTCTGGAAATGA 
TGGGGCTAGAAGAGAAATACTTATTGCTCTTGGAAAAATGTCCGCTGTTGATTATCAGGATAGAATTTATGAAATT 
TCGCTAGATAATTACGAGGGCCCATCAATTAAGGCTGCTGCAATCGAAGCGTTGTCATATCTTGCTTCAGATAAAG 
TAAGTGAAAATGCTGATTTGTATCTTCAGAGTAATAACAATAATTTAAATGTTAAATTAGCTATTATTGCTTCTTT 
GTCCAAAGATCCTTCTTTAAAGTCTAAAGAGATTTTACAAGGATTTTTAAGAGATTCTGATGATAATATTAGGTTT 
AAAGCTATTAATGCAATCAAAGGACATAGGGACTCTTCTGCAAAGGATATTTTGATTTATAAGCTTAAAAGCGATC 
CATCTCTTAAAGTTAGGGAGGCTTCTGCTAAGGCCTTAATTGATATGGATCTTGGGAATATTGAGATAAAAAACAT 
TATGTTTGATTTTAAGATTGACAATAATTTTAAAATTTCAATGTTTAGTTACCTTTTAGATAAGGATTCTCTAAAA 
GCATTGTCAATTGCTTTAGAAATTGTTAATAAAGAAAATATTAATAGACCCTCAAATGTTTTAAGGGGCGTTGCTT 
CAATGTTGGCTGGTAAAAAGGGTAATTTTGATAATTTTTATTCTAAAATCATTGACAGCAAAAATATTGATTTAAG 
GCATTTAGCATTAAAAGGAGCTGTTTATAATAAATCTTCATCGCTTTCTGATAAGCTTAAAAAAATTAAAAGTGAA 
ACGAACTCCGAATATATTAAAATGCTTTTAAAAGATTATTGA 



t43 .nt 



CTTCCTAGTCCGCCTTTGTTGCCCGAAATTACAGAAAATAAGCCTGTTGAGAGAGAAAATTCTTCTAAGGGAGAGA 
ATTTTTCTAATGTTGGTTTAGATGGTAAGTATGTTAACGATACAATTCTTTATGGGCTTGATAGTCAAGTGACAAG 
CATTATAAAAGCTCTTAAAAAATCAAGCGATAGTCAATATAATTTTTCTCTTAAAAAAAGACTTGAGAAAACTTTT 
AATGCTGAGCTTAAAAGGGAAATACTTGAATTGTTTATTTCTCTTAAGTATTCGGGGGGCATTGATACAGCAAATT 
ATATTCTTGAAAATTATGAGAGTAAAAGATATTCAAACGCTTTATTTGGCTTGGCAATTTCGTATCTTAAGGAGTT 



# 
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TABLE 1. Nucleotide and Amino Acid Sequences 

TGATGATAAAGAAAAATTAAAAAAAACTCTTATTGACATTCTTGAAAATAAAGAGGGCAATGTGGTATCTATTGCA 
GCTTATTATTTAGGAGAGCTTAATTCTCTTGAGTATTCTAAAAACATGATGGAAGTTTTTGAAAAATATTCTGGAA 
ATGATGGGGCTAGAAGAGAAATACTTATTGCTCTTGGAAAAATGTCCGCTGTTGATTATCAGGATAGAATTTATGA 
AATTTCGCTAGATAATTACGAGGGCCCATCAATTAAGGCTGCTGCAATCGAAGCGTTGTCATATCTTGCTTCAGAT 
AAAGTAACTGAAAATGCTGATTTGTATCTTCAGAGTAATAACAATAATTTAAATGTTAAATTAGCTATTATTGCTT 
CTTTGTCCAAAGATCCTTCTTTAAAGTCTAAAGAGATTTTACAAGGATTTTTAAGAGATTCTGATGATAATATTAG 
GTTTAAAGCTATTAATGCAATCAAAGGACATAGGGACTCTTCTGCAAAGGATATTTTGATTTATAAGCTTAAAAGC 
GATCCATCTCTTAAAGTTAGGGAGGCTTCTGCTAAGGCCTTAATTGATATGGATCTTGGGAATATTGAGATAAAAA 
ACATTATGTTTGATTTTAAGATTGACAATAATTTTAAAATTTCAATGTTTAGTTACCTTTTAGATAAGGATTCTCT 
AAAAGCATTGTCAATTGCTTTAGAAATTGTTAATAAAGAAAATATTAATAGACCCTCAAATGTTTTAAGGGGCGTT 
GCTTCAATGTTGGCTGGTAAAAAGGGTAATTTTGATAATTTTTATTCTAAAATCATTGACAGCAAAAATATTGATT 
TAAGGCATTTAGCATTAAAAGGAGCTGTTTATAATAAATCTTCATCGCTTTCTGATAAGCTTAAAAAAATTAAAAG 
TGAAACGAACTCCGAATATATTAAAATGCTTTTAAAAGATTATTGA 



MKFVLNNL.FKGCLICFFLFFSCLTTDRSIQDSHISDIVEXKKEAVIIDDNNWLGSNEGKFKRDYLIGLKDNESFF 
LSDAFLKENNFYFKKARESYAKKNIGLTNYYLNKIVTNENQHSRELLAKANLFFGYVNYENGFYDLSEYWFDLFLK 
DYKYSHASLRLAELKYLVKEKSDAISAFKEINEFSISGYDREIYGFLSNKLGVSHLNLESLGFLDNSVFDTFVFND 
NIFVTNILGGLLRYNIKKNDCRWLKDKKSIFLNGIRGFADYNGTIYIGGKNVVYYIDDVDGDLKQINVPGKADFS 
WQVLLAVKNGIFVGTLNSGLWFYDLKN^^ 

FSKNDNEKNINFIKEYKDSYFVGTYGGGLFELNLNKNSYKKWIANNIDVNYFMDMEIKDKKLLFATFDHGLLIYD 
S ENDNWDYFGPIWGLLNLNL I KVS RF ENYV I LGT INWGL VF VDENI KKQ L 



CLTTDRSIQDSHISDIVEKKKEAVIIDDNNWLGSNEGKFKRDYLIGLKDNESFFLSDAFLKENNFYFKKARESYA 

KKNIGLTNYYLNKIVTNENQHSRELLAKANLFFGYVNYENGFYDLSEYNFDLFLKDYKYSHASLRLAELKYLVKEK 

SDAISAFKEINEFSISGYDREIYGFLSNKLGVSHLNLESLGFLDNSVFDTFVFNDNIFVTNILGGLLRYNIKKNDC 

RVYLKDKKSIFLNGIRGFADYNGTIYIGGKIWVYYIDDVDGDLKQIWPGNADFSNVQVLLAVKJSIGIFVGTLNSGL 

WFYDLKNWKNIPLGSNKISSLCFDSLKKLLLVGTVDKAIYSVNVDNLKKIEHLDFF 

VGTYGGGLFELNLNKNSYKKHVIANNIDVNYFMm 

KVSRF ENYVI LGT INNGLVFVDENI KKQL 



ATGAAATTTGTTTTGAATAATTTATTTAAAGGTTGTCTTATATGTTTTTTCTTGTTTTTTTCCTGCCTTACTACAG 
ATAGATCTATTCAAGATTCTCATATTAGTGATATTGTAGAGAAGAAAAAAGAAGCAGTCATTATTGATGATAATAA 
TGTTGTTCTTGGGAGTAATGAGGGTAAATTTAAAAGAGACTATTTGATAGGATTAAAAGATAATGAATCTTTTTTT 
CTTAGTGATGCTTTTTTAAAAGAAAATAATTTTTATTTTAAAAAAGCCAGGGAAAGTTATGCTAAAAAAAATATTG 
GCTTGACAAATTATTATTTGAATAAAATAGTAACTAATGAGAATCAGCACAGCAGAGAATTGCTAGCTAAAGCGAA 
TTTGTTTTTTGGATATGTAAATTATGAGAATGGTTTTTATGATCTTTCCGAATATAATTTTGATCTATTTTTAAAA 
GACTATAAATATTCTCATGCTAGTTTAAGATTAGCTGAATTAAAATATCTTGTTAAAGAAAAATCTGATGCAATTT 
CTGCATTTAAAGAGATTAATGAATTTTCTATCTCAGGTTATGATAGAGAGATTTATGGCTTTTTAAGTAATAAACT 
TGGAGTAAGTCATTTAAACTTAGAGTCTTTAGGATTTCTTGACAACAGCGTTTTTGATACATTTGTCTTTAATGAC 
.AATATATTTGTAACTAATATATTGGGAGGGCTTTTAAGATATAATATTAAAAAAAATGATTGTAGAGTCTATCTTA 
AGGATAAAAAAAGCATTTTTTTAAATGGCATTAGGGGTTTTGCGGATTATAATGGAACAATTTATATTGGTGGTAA 
AAATGTTGTTTATTATATAGATGATGTTGATGGGGATTTAAAGCAAATAAATGTTCCCGGTAATGCTGATTTTAGC 
AATGTACAAGTTTTGCTTGCTGTTAAAAATGGAATATTTGTTGGCACTCTAAATTCTGGATTATGGTTTTATGATT 
TAAAAAATTGGAAAAATATACCGCTTGGATCTAATAAAATTTCTTCACTCTGCTTTGATAGTTTAAAAAATTTATT 



f50 -aa 



t50 .aa 



f50 .nt 
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TABLE 1 . Nucleotide and Amino Acid Sequences 



ATTAGTTGGAACAGTTGACAAGGCTATTTATAGTGTTAATGTCGATAATTTGAAAAAGATTGAACATTTGGATTTT 
TTTAGCAAAAATGATAATGAAAAAAATATTAATTTTATAAAAGAATATAAAGATAGTTATTTTGTTGGAACATATG 
GTGGGGGTCTTTTTGAATTAAATTTAAATAAAAATAGTTACAAAAAGCACGTTATTGCCAATAATATTGATGTTAA 
TTATTTTATGGATATGGAGATTAAAGATAAAAAGCTATTGTTTGCAACCTTTGATCATGGGTTATTGATTTATGAT 
TCTGAAAATGACAACTGGGATTATTTTGGACCCAATAATGGGCTTCTTAATTTGAATTTAATAAAAGTTTCTAGAT 
TTGAAAATTATGTCATACTGGGCACTATTAATAACGGTTTGGTTTTTGTAGATGAAAATATTAAAAAACAGTTATG 
A 



t50 -nt 



TGCCTTACTACAGATAGATCTATTCAAGATTCTCATATTAGTGATATTGTAGAGAAGAAAAAAGAAGCAGTCATTA 
TTGATGATAATAATGTTGTTCTTGGGAGTAATGAGGGTAAATTTAAAAGAGACTATTTGATAGGATTAAAAGATAA 
TGAATCTTTTTTTCTTAGTGATGCTTTTTTAAAAGAAAATAATTTTTATTTTAAAAAAGCCAGGGAAAGTTATGCT 
AAAAAAAATATTGGCTTGACAAATTATTATTTGAATAAAATAGTAACTAATGAGAATCAGCACAGCAGAGAATTGC 
TAGCTAAAGCGAATTTGTTTTTTGGATATGTAAATTATGAGAATGGTTTTTATGATCTTTCCGAATATAATTTTGA 
TCTATTTTTAAAAGACTATAAATATTCTCATGCTAGTTTAAGATTAGCTGAATTAAAATATCTTGTTAAAGAAAAA 
TCTGATGCAATTTCTGCATTTAAAGAGATTAATGAATTTTCTATCTCAGGTTATGATAGAGAGATTTATGGCTTTT 
TAAGTAATAAACTTGGAGTAAGTCATTTAAACTTAGAGTCTTTAGGATTTCTTGACAACAGCGTTTTTGATACATT 
TGTCTTTAATGACAATATATTTGTAACTAATATATTGGGAGGGCTTTTAAGATATAATATTAAAAAAAATGATTGT 
AGAGTCTATCTTAAGGATAAAAAAAGCATTTTTTTAAATGGCATTAGGGGTTTTGCGGATTATAATGGAACAATTT 
ATATTGGTGGTAAAAATGTTGTTTATTATATAGATGATGTTGATGGGGATTTAAAGCAAATAAATGTTCCCGGTAA 
TGCTGATTTTAGCAATGTACAAGTTTTGCTTGCTGTTAAAAATGGAATATTTGTTGGCACTCTAAATTCTGGATTA 
TGGTTTTATGATTTAAAAAATTGGAAAAATATACCGCTTGGATCTAATAAAATTTCTTCACTCTGCTTTGATAGTT 
TAAAAAATTTATTATTAGTTGGAACAGTTGACAAGGCTATTTATAGTGTTAATGTCGATAATTTGAAAAAGATTGA 
ACATTTGGATTTTTTTAGCAAAAATGATAATGAAAAAAATATTAATTTTATAAAAGAATATAAAGATAGTTATTTT 
GTTGGAACATATGGTGGGGGTCTTTTTGAATTAAATTTAAATAAAAATAGTTACAAAAAGCACGTTATTGCCAATA 
ATATTGATGTTAATTATTTTATGGATATGGAGATTAAAGATAAAAAGCTATTGTTTGCAACCTTTGATCATGGGTT 
ATTGATTTATGATTCTGAAAATGACAACTGGGATTATTTTGGACCCAATAATGGGCTTCTTAATTTGAATTTAATA 
AAAGTTTCTAGATTTGAAAATTATGTCATACTGGGCACTAT'TAATAACGGTTTGGTTTTTGTAGATGAAAATATTA 
AAAAACAGTTATGA 



f65.aa 

miFKNVPFQINLILFLLVSVAKINASSKFYYAEQWYVIFNSQMKKKPENYKKNIFFLQKALKYPFGNPKYSLTKI 
ETKEQWEKYKLLFKMHVNLLLVRQNLHLGDLFDTRNLYFFKTPEKDGIISNLEKSKKLYKLAINYYSEALKYHKKL 
EISTiTTTVKLENDGITNWEDEYHKISLKELNYYDIIKKELLRIDETKAFFEQGPNYY 

t65 . aa 

KINASSKFYYAEQWYVIFNSQMKKKPENYKKNIFFLQKALKYPFGNPKYSLTKIETKEQWEKYKLLFKMHVNLLLV 
RQNLHLGDLFDTRWLYFFKTPEKDGIISNLEKSKKLYKLAINYYSEALKYHKKLENYTTVKLENDGITNWEDEYHK 
ISLKELNYYDIIKKELLRIDETKAFFEQGPNYY 

f65 .nt 

ATGCATATTTTCAAAAATGTCCCCTTCCAAATAAATTTAATTTTATTTCTTTTAGTATCAGTTGCAAAGATAAATG 
CATCGTCCAAATTTTATTACGCAGAACAATGGTATGTAATTTTTAATTCTCAAATGAAAAAAAAACCTGAAAACTA 
TAAAAAAAATATATTTTTTCTTCAAAAAGCCTTAAAATACCCATTTGGAAATCCAAAATATTCTCTAACTAAAATA 
GAAAC C AAAGAAC AGTGGGAAAAAT ATAAAC TTC TTTTC AAAATGC ATGT AAAC TTGCTTCT AGTT AGGC AAAATT 
TACATTTAGGAGATTTATTCGACACAAGAAATTTATATTTTTTCAAAACTCCAGAAAAAGATGGAATTATTTCCAA 
TCTAGAAAAATCAAAAAAATTATATAAACTAGCTATTAATTACTACAGCGAAGCACTAAAATACCACAAAAAACTT 
GAAAATTACACAACTGTTAAACTAGAAAACGATGGAATAACAAACTGGGAAGATGAATATCATAAAATTTCTCTTA 



81 

TABLE 1 . Nucleotide and Amino Acid Sequences 



AAGAGCTTAATTACTATGACATTATTAAAAAAGAACTACTAAGAATTGACGAAACTAAAGCATTTTTTGAACAAGG 
GCCAAAC TATTATTAA 

t65 -nt 

KINASSKFYYAEQWY^/IFNSQMKKKPENYKKNIFFLQKALKYPFGNPKYSLTKIETKEQWEKYKLLFKMHWLLLV 
RQNLHLGDLFDTRNLYFFKTPEKDGIISNLEKSKKLYKLAINYYSEALKYHKKLENYTTVKLENDGITNWEDEYHK 
I SLKELNYYDI IKKELLRIDETKAFFEQGPNYY 



£8 .aa 



MKNINRLILLILTTHTLLFSCALIADNKSKNLSTSEIILTQKTLLESSLIKNPSWEYRIPISSIQEILMSJNNDSF 
LIKKTAAKIKISPQKLEEIKNYLNAYKNYLOTSfETEWIKFIDQSSWGNLTIKIDTAFEKKTNFNHTNSDNENLTEL 
IELQMHLEKEILNLIEQTFHDKNLGYIQLSHINSFFPQENINSITKEIIDGKEYIAPHIIANQLLKIKDKKYFEQF 
MHFLKVENSKIKTIIEKQKISDLHNELYYSKQSPPRRRKRSTADSDNNNKYDI IPKIIDPNTGIEITPKNLRSILS 
NGDIILIKPKIDWTEFFYFWQHVGIFDEEKYEATKKIAFNGIDSFDIKSIITSNQIKFDTASTQGSGYEKLSTYVQ 
SRILKIFSPITDIRTIQKAINFGRSRYIDNNFGYMVPLISSNLWTDSFNLEEIHNKTYCSLMVDRIYKIAGLNVSR 
NYEISGIITPGEINAAAYNFYMSYTIAGILPSVLPKRLIKPTLKEKFIGYWKEIVDAIELKKSKEKIFGRACNITN 
LWCSGS 



t8 .aa 



CALIADl^SKNLST^IILTQKTLLESSLIKNPSNVEYRIPISSIQEILNOTJNDSFLIKKTAAKIKISPQKLEEIK 
NYLNAYKNYLNNETEWIKFIDQSSVNGNLTIKIDTAFEKKTNFNKTNSDNENLTELIELQMHLEKEILNLIEQTFH 
DKNLGYIQLSHINSFFPQENINSITKEIIDGKEYIAPHIIANQLLKIKDKKYFEQFMHFLKVENSKIKTIIEKQKI 
SDLHNELYYSXQSPPRRRKRSTADSDNNNKYDIIPKIIDPNTGIEITPKNLRSILSNGDIILIKPKIDWTEFFYFW 
QHVGIFDEEKYEATKKIAFNGIDSFDIKSIITSNQIKFDTASTQGSGYEKLSTYVQSRILKIFSPITDIRTIQKAI 
NFGRSRYIDNNFGYMVPLISSNLWTDSFNLEEIHNKTYCSLMVDRIYKIAGLNVSRNYEISGIITPGEINAAAYNF 
YMSYTIAGILPSVLPKRLIKPTLKEKFIGYNKEIVDAIELKKSKEKIFGRACNITNLWCSGS 



f 8 .nt 



ATGAAGAATATTAATAGATTAATATTATTAATATTAACTACACACACTTTATTATTCTCTTGTGCCTTAATTGCAG 
ATAATAAGTCAAAAAATTTAAGCACATCAGAAATCATATTAACACAAAAAACACTACTAGAAAGCTCTTTAATAAA 
AAATCCTTCTAATGTAGAATATCGAATACCAATATCCAGTATCCAAGAAATTTTAAACAATAACAATGATTCTTTT 
TTAATAAAAAAAACAGCAGCAAAAATCAAAATAAGCCCTCAAAAACTTGAAGAAATAAAAAACTATCTAAATGCTT 
ATAAAAATTATCTAAATAATGAAACAGAATGGATAAAGTTTATAGATCAAAGTAGCGTCAATGGAAATTTAACAAT 
TAAAATTGATACTGCTTTTGAAAAAAAAACAAATTTTAATCATACAAATTCAGATAATGAAAATTTAACAGAACTA 
ATAGAACTACAAATGCATCTGGAAAAAGAAATTTTAAACTTAATTGAGCAAACATTTCATGATAAAAATTTAGGAT 
ATATACAATTAAGTCACATCAACTCATTCTTTCCTCAAGAAAATATAAACTCAATAACAAAAGAAATAATAGATGG 
AAJVAGAATATATTGCACCGCACATAATAGCAAATCAATTATTAAAAATAAAAGATAAAAAATATTTTGAACAATTT 
ATGCACTTTTTAAAAGTTGAAAACAGCAAAATAAAAACAATAATTGAAAAACAAAAAATTTCAGATCTTCACAATG 
AACTGTATTATTCAAAACAATCCCCGCCCAGAAGAAGAAAAAGGTCAACTGCCGATTCCGATAATAACAATAAATA 
CGATATAATACCAAAAATAATAGACCCAAATACAGGCATTC-AAATAACTCCTAAAAATTTAAGATCTATTTTATCA 
AATGGCGACATAATACTAATAAAACCAAAAATAGATTGGACAGAATTTTTTTATTTTTGGCAACATGTGGGAATAT 
TTGATGAAGAAAAATATGAAGCCACTAAAAAAATTGCATTCAA'TGGAATTGATAGCTTTGATATAAAATCAATAAT 
TACAAGCAATCAAATCAAATTCGATACAGCATCTACTCAAGGTTCAGGATACGAAAAGCTTTCAACATACGTACAA 
TCAAGAATATTAAAAATATTCTCACCAATAACAGACATAAGAACAATTCAAAAAGCTATTAATTTTGGAAGAAGTA 
GATACATTGACAATAACTTTGGATATATGGTTCCATTAATATCCTCTAATTTATGGACAGATTCATTCAATCTTGA 
AGAAATTCACAACAAAACCTATTGCTCTTTAATGGTTGATAGAATATATAAAATAGCAGGACTTAATGTATCAAGA 



82 




TABLE 1. Nucleotide and Amino Acid Sequences 

AATTACGAAATTTCGGGAATAATTACTCCTGGAGAAATAAATGCAGCAGCTTACAATTTTTACATGTCTTATACGA 
TTGCAGGAATACTTCCAAGCGTGCTTCCAAAAAGGCTCATTAAACCAACATTAAAAGAAAAATTCATTGGTTACAA 
TAAAGAAATAGTAGATGCAATAGAATTAAAAAAATCGAAAGAAAAAATTTTTGGGAGAGCTTGCAACATTACAAAT 
CTCTGGTGCTCAGGAAGTTAA 



TGTGCCTTAATTGCAGATAATAAGTCAAAAAATTTAAGCACATCAGAAATCATATTAACACAAAAAACACTACTAG 
AAAGCTCTTTAATAAAAAATCCTTCTAATGTAGAATATCGAATACCAATATCCAGTATCCAAGAAATTTTAAACAA 
TAACAATGATTCTTTTTTAATAAAAAAAACAGCAGCAAAAATCAAAATAAGCCCTCAAAAACTTGAAGAAATAAAA 
AACTATCTAAATGCTTATAAAAATTATCTAAATAATGAAACAGAATGGATAAAGTTTATAGATCAAAGTAGCGTCA 
ATGGAAATTTAACAATTAAAATTGATACTGCTTTTGAAAAAAAAACAAATTTTAATCATACAAATTCAGATAATGA 
AAATTTAACAGAACTAATAGAACTACAAATGCATCTGGAAAAAGAAATTTTAAACTTAATTGAGCAAACATTTCAT 
GATAAAAATTTAGGATATATACAATTAAGTCACATCAACTCATTCTTTCCTCAAGAAAATATAAACTCAATAACAA 
AAGAAATAATAGATGGAAAAGAATATATTGCACCGCACATAATAGCAAATCAATTATTAAAAATAAAAGATAAAAA 
ATATTTTGAACAATTTATGCACTTTTTAAAAGTTGAAAACAGCAAAATAAAAACAATAATTGAAAAACAAAAAATT 
TCAGATCTTCACAATGAACTGTATTATTCAAAACAATCCCCGCCCAGAAGAAGAAAAAGGTCAACTGCCGATTCCG 
ATAATAACAATAAATACGATATAATACCAAAAATAATAGACCCAAATACAGGCATTGAAATAACTCCTAAAAATTT 
AAGATCTATTTTATCAAATGGCGACATAATACTAATAAAACCAAAAATAGATTGGACAGAATTTTTTTATTTTTGG 
CAACATGTGGGAATATTTGATGAAGAAAAATATGAAGCCACTAAAAAAATTGCATTCAATGGAATTGATAGCTTTG 
ATATAAAATCAATAATTACAAGCAATCAAATCAAATTCGATACAGCATCTACTCAAGGTTCAGGATACGAAAAGCT 
TTCAACATACGTACAATCAAGAATATTAAAAATATTCTCACCAATAACAGACATAAGAACAATTCAAAAAGCTATT 
AATTTTGGAAGAAGTAGATACATTGACAATAACTTTGGATATATGGTTCCATTAATATCCTCTAATTTATGGACAG 
ATTCATTCAATCTTGAAGAAATTCACAACAAAACCTATTGCTCTTTAATGGTTGATAGAATATATAAAATAGCAGG 
ACTTAATGTATCAAGAAATTACGAAATTTCGGGAATAATTACTCCTGGAGAAATAAATGCAGCAGCTTACAATTTT 
TACATGTCTTATACGATTGCAGGAATACTTCCAAGCGTGCTTCCAAAAAGGCTCATTAAACCAACATTAAAAGAAA 
AATTCATTGGTTACAATAAAGAAATAGTAGATGCAATAGAATTAAAAAAATCGAAAGAAAAAATTTTTGGGAGAGC 
TTGCAACATTACAAATCTCTGGTGCTCAGGAAGTTAA 



MTRVF SKFFLFFCF SMLLFANSEDSNEKD I VS KDENPVFENEVLGYWVG YNDVSNIKNS 1 1 YI YKYNGE VYGRI LT 
IIKDGKKYDAKNPSGDTWGFENLAIEGLDFMWGLKYSSSSKKWDRGKIIDPKNGKIYNSEMRVDSKTGNLITKGK 
VWI FGRSKI WTRAKDDE I PKLDLHNLVPAP PVKK 



EDSNEKDIVSKDENPVFENEVLGYWGYNDVSNIK^ISIIYIYKYNGEVYGP.ILTIIKDGKKYDAKNPSGDTWGFE 
NLAIEGLDFMWGLKYSSSSKKWDRGKIIDPKNGKIYNSEMRVDSKTGNLITKGKVWIFGRSKIWTRAKDDEIPKLD 
LHNLVPAPPVKK 



ATGACTAGAGTTTTTTCAAAGTTTTTTCTTTTTTTTTGTTTTTCAATGCTTTTATTTGCAAATTCAGAAGATTCAA 
ATGAAAAGGACATTGTTAGCAAGGATGAAAACCCTGTTTTTGAAAATGAAGTTTTAGGATATTGGGTTGGTTATAA 
TGATGTAAGTAACATAAAGAATTCTATTATCTATATTTATAAATATAATGGGGAAGTTTATGGCCGAATTTTAACT 
ATAATAAAAGATGGCAAAAAGTATGATGCTAAAAATCCTTCAGGAGATACTGTAGTTGGGTTTGAAAATCTTGCAA 
TAGAGGGTCTTGATTTTATGTGGGGTCTTAAGTATTCTTCTTCTTCTAAAAAGTGGGATAGGGGCAAAATAATAGA 



t8 .nt 



f82 .aa 



t82 . 



f82nt 




8: 




TABLE 1. Nucleotide and Amino Acid Sequences 

TCCTAAAAACGGTAAAATTTATAATTCTGAGATGCGTGTTGATAGTAAAACCGGAAATCTTATTACCAA.GGGGAAA 
GTTTGGATTTTTGGTAGAAGTAAAATTTGGACAAGAGCTAAAGATGATGAAATACCAAAATTAGATTTGCATAATC 
TTGTTCCAGCGCCCCCTGTGAAAAAATAA 



GAAGATTCAAATGAAAAGGACATTGTTAGCAAGGATGAAAACCCTGTTTTTGAAAATGAAGTTTTAGGATATTGGG 
TTGGTTATAATGATGTAAGTAACATAAAGAATTCTATTATCTATATTTATAAATATAATGGGGAAGTTTATGGCCG 
AATTTTAACTATAATAAAAGATGGCAAAAAGTATGATGCTAAAAATCCTTCAGGAGATACTGTAGTTGGGTTTGAA 
AATCTTGCAATAGAGGGTCTTGATTTTATGTGGGGTCTTAAGTATTCTTCTTCTTCTAAAAAGTGGGATAGGGGCA 
AAATAATAGATCCTAAAAACGGTAAAATTTATAATTCTGAGATGCGTGTTGATAGTAAAACCGGAAATCTTATTAC 
CAAGGGGAAAGTTTGGATTTTTGGTAGAAGTAAAATTTGGACAAGAGCTAAAGATGATGAAATACCAAAATTAGAT 
TTGCATAATCTTGTTCCAGCGCCCCCTGTGAAAAAATAA 



MNKLMLMLITFATSLLAQTNKASTGLKTDQSFNNSLSESVKLiKEIADI YPTNTNFLTGIGIVAGLAGKGDSIKQKD 

LIIKILEENNIINEIGS]SMIESKKIALVWSLQVKGNTIKGSKHKACVASILDSKDLTNGILLKTNLKNKEGSIIA 

IASGITQPNNKLKGSGYTIDSVIINENQNIJraSYNIILKKGNYTLimiHKILTSKKINNKIKSDSTIEIEAKNIS 

LLEEIENIKIETNPKILIDKKNGIILASEISIAKIGTFT^ 

NK E L I Q 1 1 Q AAQ K INKLNGE L I L EE I DGNQN 



LKTDQSFCMSLSESVKLKEIADIYPTNTNFLTGIGIVAGLAGKGDSIKQKDLIIKILEENNIINEIGSNNIESKNI 
ALVWSLQVKGNTIKGSKHKACVASILDSKDLTNGILLKTNLKNKEGEIIAIASGITQPNNKLKGSGYTIDSVIIN 
ENQNIMiSYNIILKKGNYTLINRIHKILTSKKINNKIKSDSTIEIEAKNISLLEEIENIKIETNPKILIDKKNGII 
LASEJmKIGTFTFSIEKDNQNIFLSKMSTKTTIQVNSMKLNEFILKNSNNLSNKELIQIIQAAQKINKLNGELILEE 
IDGNQN 



ATGAACAAACTAATGTTGATGTTAATTACATTTGCAACGAGTCTATTAGCCCAAACAAACAAAGCTTCAACAGGAC 
TAAAAACAGATCAATCATTTAACAATAGCCTATCTGAAAGCGTAAAATTAAAAGAAATTGCGGATATTTATCCCAC 
AAATACAAATTTTTTAACAGGTATTGGAATAGTAGCGGGACTTGCTGGAAAAGGAGACTCTATAAAACAAAAAGAC 
CTTATAATTAAAATTTTAGAAGAAAACAATATAATAAATGAAATAGGCTCTAATAACATAGAAAGTAAAAATATTG 
CACTAGTAAATGTCAGTCTCCAAGTAAAAGGTAATACAATCAAAGGTTCAAAACATAAAGCTTGCGTTGCATCAAT 
ACTGGACTCAAAAGATTTAACAAATGGAATACTTTTAAAAACAAATCTTAAAAATAAAGAGGGGGAAATAATAGCA 
ATTGCATCAGGAATTACACAGCCCAATAATAAATTAAAAGGATCTGGATATACTATAGATAGTGTAATAATAAATG 
AGAATCAAAATATTAACCACAGTTATAATATAATTCTTAAAAAAGGAAATTATACATTAATAAATAGAATTCATAA 
AATATTAACCTCTAAAAAAATCAACAACAAAATTAAATCAGACAGCACAATAGAAATAGAAGCAAAAAACATAAGC 
CTATTAGAAGAGATTGAAAATATTAAAATAGAAACCAACCCCAAGATATTAATAGACAAAAAAAATGGTATTATTT 
TAGCAAGTGAAAATGCAAAAATAGGAACTTTTACATTTTCCATTGAAAAAGACAATCAAAACATTTTTTTAAGTAA 
AAATAACAAAACAACAATTCAAGTAAACTCAATGAAATTAAATGAATTTATATTAAAAAATTCCAACAATCTTAGC 
AATAAAGAATTAATTCAAATAATTCAAGCTGCGCAAAAAATTAATAAATTAAATGGGGAACTTATCTTGGAGGAAA 
TTGATGGAAACCAAAATTAA 



f82 .nt 



f86 .aa 



t86 . aa 



f8 6.nt 



t S 6 . nt 



TABLE 1 . Nucleotide and Amino Acid Sequences 



CTAAAAACAGATCAATCATTTAACAATAGCCTATCTGAAAGCGTAAAATTAAAAGAAATTGCGGATATTTATCCCA 
CAAATACAAATTTTTTAACAGGTATTGGAATAGTAGCGGGACTTGCTGGAAAAGGAGACTCTATAAAACAAAAAGA 
CCTTATAATTAAAATTTTAGAAGAAAACAATATAATAAATGAAATAGGCTCTAATAACATAGAAAGTAAAAATATT 
GCACTAGTAAATGTCAGTCTCCAAGTAAAAGGTAATACAATCAAAGGTTCAAAACATAAAGCTTGCGTTGCATCAA 
TACTGGACTCAAAAGATTTAACAAATGGAATACTTTTAAAAACAAATCTTAAAAATAAAGAGGGGGAAATAATAGC 
AATTGCATCAGGAATTACACAGCCCAATAATAAATTAAAAGGATCTGGATATACTATAGATAGTGTAATAATAAAT 
GAGAATCAAAATATTAACCACAGTTATAATATAATTCTTAAAAAAGGAAATTATACATTAATAAATAGAATTCATA 
AAATATTAACCTCTAAAAAAATCAACAACAAAATTAAATCAGACAGCACAATAGAAATAGAAGCAAAAAACATAAG 
C C T ATT AG AAGAGAT TG AAAAT ATT AAAAT AGAAAC C AAC C C C AAG AT ATT AAT AG AC AAAAAAAAT GGT ATT ATT 
TTAGCAAGTGAAAATGCAAAAATAGGAACTTTTACATTTTCCATTGAAAAAGACAATCAAAACATTTTTTTAAGTA 
AAAATAACAAAACAACAATTCAAGTAAACTCAATGAAATTAAATGAATTTATATTAAAAAATTCCAACAATCTTAG 
CAATAAAGAATTAATTCAAATAATTCAAGCTGCGCAAAAAATTAATAAATTAAATGGGGAACTTATCTTGGAGGAA 
ATTGATGGAAACCAAAATTAA 



f90 .aa 



MCPITFTIPFFLAIFFAFSSSFVTDSSVSLLSRNTSLFSTLTPISLPIISGTLPAIVTLSKKYLSISLSFSKMIFI 
■KSLFEVIXLPIWLFIIFASGYFLNAFSIFLCISSFLSFMFI 



t90 .aa 



SSFVTDSSVSLLSRNTSLFSTLTPISLPIISGTLPAIVTLSKKYLSISLSFSKMIFIKSLFEVIKLPIWLFIIFAS 
GYFLNAFSJFLCISSFLSFMFI 

f90 .nt 



ATGTGTCCTATTACTTTTACCATTCCATTTTTTCTAGCAATATTTTTTGCTTTTTCAAGCTCCTTTGTTACGGACT 
CTTCTGTGTCTTTGCTATCAAGAAATACGTCTCTTTTTTCTACTTTAACTCCAATTTCTTTGCCTATTATTTCTGG 
TACGCTTCCTGCAATAGTTACGCTGTCGAAAAAATATCTGTCAATCTCTTTAAGCTTTTCTAAAATGATTTTCATC 
AAATCTTTATTTGAAGTGATTAAACTTCCCATATGGTTATTCATTATTTTTGCATCAGGATACTTTTTAAATGCTT 
TTTCGATTTTTTTGTGTATTTCTTCTTTTTTATCTTTTATGTTTATATGA 



t90 .nt 



AGCTCCTTTGTTACGGACTCTTCTGTGTCTTTGCTATCAAGAAATACGTCTCTTTTTTCTACTTTAACTCCAATTT 
CTTTGCCTATTATTTCTGGTACGCTTCCTGCAATAGTTACGCTGTCGAAAAAATATCTGTCAATCTCTTTAAGCTT 
TTCTAAAATGATTTTCATCAAATCTTTATTTGAAGTGATTAAACTTCCCATATGGTTATTCATTATTTTTGCATCA 
GGATACTTTTTAAATGCTTTTTCGATTTTTTTGTGTATTTCTTCTTTTTTATCTTTTATGTTTATATGA 



f469 .aa 

MANVALSSGFISQKIFGIIIIMVFLPTIIATPIINFLFKINKSGLKKSLPIDQNTHICVSFEYDNLAKILIWDFKN 
ELRKEGFFTQQIKNDSSQYINARKNNISFSIKREGSKITFEC PNNHLIIIQDLFRETILNLEKITKEVETVSLRAK 
KLDYSIOTDKILSNINLNKRIKKENIILELKSSNKADVIRELLSVINIEIDKZRIFQDLMEREKLITTALKEGFAI 
PHLKTNLISKIHIAIGISHEGIDFNALDKNLSHVFILILCPAKDYVSYPRILASWGKVDLYKKEILNAKTDKEIY 
NIIVSZ 



t469 .aa 
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TABLE 1 . Nucleotide and Amino Acid Sequences 

VFLPTIIATPIINFLFKINKSGLKKELPIDQNTHICVSFEYDNLAKILIWDFKNELRKEGFFTQQIKNDSSQYINA 
RKNNISFSIKREGSKITFECPNNHLIIIQDLFRETILNLEKITKEVETVSLRAKKLDYSINYDKILSNINLNKRIK 
KENIILELKSSNKADVIRELLSVINIEIDKERIFQDLMEREKLITTALKEGFAIPHLKTNLISKIHIAIGISHEGI 
DFKALDKNLSHVFILILCPAKDYVSYPRILASWGKVDLYKKEILNAKTDKEIYNIIVSZ 

f469.nt 

ATGGCAAATGTAGCATTATCTTCAGGATTTATTAGCCAAAAAATATTTGGAATCATAATAATAATGGTGTTTTTGC 
CAACAATCATTGCAACACCCATAATAAACTTTTTATTTAAAATAAATAAAAGTGGACTTAAAAAAGAACTCCCAAT 
AGATCAAAATACACACATATGCGTATCATTTGAATATGATAATTTAGCCAAAATTCTTATATGGGACTTTAAAAAT 
GAGTTAAGAAAAGAAGGATTTTTTACACAACAAATTAAAAATGATTCTTCACAATATATTAATGCAAGAAAAAACA 
ATATATCCTTCTCAATAAAACGAGAAGGTAGCAAAATCACATTTGAATGCCCAAATAATCATTTAATTATAATACA 
AGATCTTTTTAGAGAAACAATCTTAAACCTAGAAAAAATAACCAAAGAAGTTGAAACAGTCTCTTTAAGAGCAAAA 
AAACTAGATTACTCAATAAATTACGATAAAATCCTTAGTAATATCAACCTAAATAAAAGAATAAAAAAGGAAAACA 
TTATTCTAGAATTAAAATCAAGCAATAAGGCTGATGTAATAAGAGAGCTTCTAAGCGTAATAAACATTGAAATTGA 
TAAAGAAAGAATATTCCAAGATTTAATGGAAAGAGAAAAGTTAATTACTACTGCACTAAAAGAAGGCTTTGCCATT 
CCCCATTTAAAAACAAATTTAATATCAAAAATACATATTGCAATAGGAATAAGCCATGAGGGAATTGACTTTAATG 
CTCTTGACAAGAACTTAAGTCATGTTTTTATATTAATACTGTGCCCAGCAAAAGATTACGTTAGCTACCCTAGAAT 
TTTAGCATCTGTTGTGGGCAAAGTTGATCTGTACAAAAAAGAAATTTTAAATGCAAAAACAGATAAAGAAATTTAT 
AATATAATAGTGAGCTAA 

t469 .nt 

TTTTTGCCAACAATCATTGCAACACCCATAATAAACTTTTTATTTAAAATAAATAAAAGTGGACTTAAAAAAGAAC 
TCCCAATAGATCAAAATACACACATATGCGTATCATTTGAATATGATAATTTAGCCAAAATTCTTATATGGGACTT 
TAAAAATGAGTTAAGAAAAGAAGGATTTTTTACACAACAAATTAAAAATGATTCTTCACAATATATTAATGCAAGA 
AAAAACAATATATCCTTCTCAATAAAACGAGAAGGTAGCAAAATCACATTTGAATGCCCAAATAATCATTTAATTA 
TAATACAAGATCTTTTTAGAGAAACAATCTTAAACCTAGAAAAAATAACCAAAGAAGTTGAAACAGTCTCTTTAAG 
AGCAAAAAAACTAGATTACTCAATAAATTACGATAAAATCCTTAGTAATATCAACCTAAATAAAAGAATAAAAAAG 
GAAAACATTATTCTAGAATTAAAATCAAGCAATAAGGCTGATGTAATAAGAGAGCTTCTAAGCGTAATAAACATTG 
AAATTGATAAAGAAAGAATATTCCAAGATTTAATGGAAAGAGAAAAGTTAATTACTACTGCACTAAAAGAAGGCTT 
TGCCATTCCCCATTTAAAAACAAATTTAATATCAAAAATACATATTGCAATAGGAATAAGCCATGAGGGAATTGAC 
TTTAATGCTCTTGACAAGAACTTAAGTCATGTTTTTATATTAATACTGTGCCCAGCAAAAGATTACGTTAGCTACC 
CTAGAATTTTAGCATCTGTTGTGGGCAAAGTTGATCTGTACAAAAAAGAAATTTTAAATGCAAAAACAGATAAAGA 
- AATTTAT AATATAATAGTGAGCTAA 

f477 .aa 

MEKPQGVSIVGAISGAMHVHLMAEHYGVPWLHTDHCAKNLLPWVEGLLEYGEKYYSQHKKPLFSSHMLDLSEEPI 
KEMIEISKKFLERMAKIEMFLEIELGITGGEEDGVDNSDRALHELFSTPEDIYYGYSELLKVSPNFQIAAAFGNVH 
GVYKPGNVKLTPK^/LKDGQDWISKTGVTSIMAKPVSWFHGGSGSTIDEINEALSYGWKMNIDTDTQWAAWEGVLN 
YYKKNESRLQGQLGDGKDIDIPNKKFYDPRWJLREAEVSMKDRWIACKNLNNINRNZ 

t477.aa 

MBTVHLMAEHYGVPVVLHTDHCAKNLLPWVEGLLEYGEKYYSQHKKPLFSSHMLDLSEEPIKENIEISKKFLERMAK 
IEMFLEIELGITGGEEDGVDNSDRALHELFSTPEDIYYGYSELLKVSPNFQIAAAFGNVHGVYKPGNVKLTPKVLK 
DGQDYVISKTGVNMAKPVSYVFHGGSGSTIDEINEALSYGVVKMN'IDTDTQWAAWEGVLNYYKKNESRLQGQLGDG 
KDIDIPNKKFYDPRVWLREAEYSMKDRVKIACKNLNNINRNZ 

f477.nt 

ATGGAAAAACCACAAGGAGTTTCAATAGTTGGAGCTATTTCTGGTGCTATGCATGTTCATTTAATGGCAGAGCATT 
. ATGGTGTTCCTGTTGTTCTTCATACTGATCACTGTGCTAAAAATTTGCTTCCTTGGGTTGAAGGCCTTTTAGAATA 
TGGAGAGAAATACTATAGTCAGCACAAAAAACCATTATTTTCTTCACATATGTTAGATTTATCAGAAGAACCTATT 
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TABLE 1. Nucleotide and Amino Acid Sequences 

AAAGAAAATATTGAAATTTCTAAAAAATTCTTAGAAAGAATGGCAAAAATTGAAATGTTTTTGGAAATAGAGCTTG 
GAATTACGGGTGGGGAAGAGGATGGAGTTGACAATTCAGATAGAGCTTTGCATGAACTATTTTCTACTCCTGAGGA 
TATTTATTATGGATATTCAGAACTTTTAAAAGTTAGCCCAAATTTTCAGATTGCAGCAGCTTTTGGAAATGTTCAT 
GGGGTATATAAACCGGGGAATGTTAAGCTTACTCCAAAAGTTTTAAAAGATGGTCAAGATTATGTCATATCAAAAA 
CAGGAGTAAATATGGCTAAGCCAGTTTCTTATGTTTTTCATGGAGGGTCTGGATCTACAATTGATGAGATTAATGA 
GGCGCTTTCTTATGGCGTTGTAAAGATGAATATTGACACAGATACACAGTGGGCTGCCTGGGAGGGTGTTTTAAAT 
TATTACAAAAAAAATGAAAGTCGTTTGCAAGGTCAATTAGGAGATGGCAAGGATATTGATATTCCAAATAAGAAAT 
TTTATGATCCAAGGC-TTTGGTTAAGAGAAGCTGAAGTTTCTATGAAAGACCGTGTGAAGATTGCATGCAAAAATCT 
TAATAATATTAATAGAAATTAA 



ATGCATGTTCATTTAATGGCAGAGCATTATGGTGTTCCTGTTGTTCTTCATACTGATCACTGTGCTAAAAATTTGC 
TTCCTTGGGTTGAAGGCCTTTTAGAATATGGAGAGAAATACTATAGTCAGCACAAAAAACCATTATTTTCTTCACA 
TATGTTAGATTTATCAGAAGAACCTATTAAAGAAAATATTGAAATTTCTAAAAAATTCTTAGAAAGAATGGCAAAA 
AfTGAAATGTTTTTGGAAATAGAGCTTGGAATTACGGGTGGGGAAGAGGATGGAGTTGACAATTCAGATAGAGCTT 
TGCATGAACTATTTTCTACTCCTGAGGATATTTATTATGGATATTCAGAACTTTTAAAAGTTAGCCCAAATTTTCA 
GATTGCAGCAGCTTTTGGAAATGTTCATGGGGTATATAAACCGGGGAATGTTAAGCTTACTCCAAAAGTTTTAAAA 
GATGGTCAAGATTATGTCATATCAAAAACAGGAGTAAATATGGCTAAGCCAGTTTCTTATGTTTTTCATGGAGGGT 
CTGGATCTACAATTGATGAGATTAATGAGGCGCTTTCTTATGGCGTTGTAAAGATGAA'TATTGACACAGATACACA 
GTGGGCTGCCTGGGAGGGTGTTTTAAATTATTACAAAAAAAATGAAAGTCGTTTGCAAGGTCAATTAGGAGATGGC 
AAGGATATTGATATTCCAAATAAGAAATTTTATGATCCAAGGGTTTGGTTAAGAGAAGCTGAAGTTTCTATGAAAG 
ACCGTGTGAAGATTGCATGCAAAAATCTTAATAATATTAATAGAAATTAA 



MPSSFPFLLVNGSSGIAVGRUVTNMASHI^REICDAIVYMIjDNENASIFDLLKIVKGPDFPTFGEIVYNDNLIKAYK 
TGKGSVVIRARYHIEERAEDRNAIIVTEIPYTVWKSALLMKVALLAKEEKLEGLLDIRDESDREGIRIVLEVKRGF 
DPHVIMNLLYEYTEFKKHFSINNLALVNGIPKQLNLEELLFEFIEHRKNIIERRIEFDLRKAKEKAHVLEGLNIAL 
NNIDEVIKIIKSSKLAKDARERLVSNFGLSEIQANSVLDMRLQKLTALEIFKLEEELNILLSLIKDYEDILLNPVR 
IINIIREETINLGLKFGDERRTKIIYDEEVLKTSMSDLMQKENIWMLTKKGFLKRLSQNEYKLQGTGGKGLSSFD 
LNDGDEIVIALCVNTHDYLFMISNEGKLYLINAYEIKDSSRASKGQNISELINLGDQEEILTIKNSKDLTDDAYLL 
LTTASGKIARFESTDFKAVKSRGVIVIKLNDKDFVTSAEIVFKDEKVICLSKKGSAFIFNSRDVRLTNRGTQGVCG 
MKLKEGDLFVKVLSVKENPYLLIVSENGYGKRLNMSKISELKRGATGYTSYKKSDKKAGSVVDAIAVSEDDEILLV 
SKRSKALRTVAGKVSEQGKDARGIQVLFLDNDSLVSVSKFIKZ 



i^TMXIAPHNLREICDAIVYMLDNENASIFDLLKIVKGPDFPTFGEIVYNDNLIKAYKTGKGSVVIRARYHIEERAE 
DRNAIIVTEIPYTVNKSALLMKVALLAKEEKLEGLLDIRDESDREGIRIVLEVKRGFDPHVIMNLLYEYTEFKKHF 
SIJSWLALVNGIPKQLNLEELLFEFIEHRKNIIERRIEFDLRKAKEKAHVLEGLNIALNNIDEVIKIIKSSKLAKDA 
RERLVSNFGLSEIQANSVLDMRLQKLTALEIFKLEEELNILLSLIKDYEDILLNPVRIINIIREETINLGLKFGDE 
RRTKIIYDEEVLKTSMSDLMQKENIWMLTKKGFLKRLSQNEYKLQGTGGKGLSSFDLNDGDEIVIALCVNTHDYL 
FMISNEGKLYLINAYEIKDSSRASKGQNISELINLGDQEEILTIKNSKDLTDDAYLLLTTASGKIARFESTDFKAV 
KSRGVIVIKLNDKDFVTSAEIVFKDEKVICLSKKGSAFIFNSRDVRLTNRGTQGVCGMKLKEGDLFVKVLSVKENP 
YLLIVSENGYGKRLNMSKISELKRGATGYTSYKKSDKKAGSWDAIAVSEDDEILLVSKRSKALRTVAGKVSEQGK 
DARGIQVLFLDNDSLVSVSKFIKZ 



ATGCCGTCATCATTTCCATTTCTTTTGGTAAATGGCTCTAGTGGAATTGCTGTTGGAATGGCTACTAATATGGCAC 
CTCATAATTTAAGAGAAATTTGTGATGCCATTGTTTACATGCTAGATAATGAGAATGCTTCTATATTTGATTTGCT 
T AAAATAGTTAAAGGGCC TGATTTCC C AAC TTTTGGAGAGATTGTTT ATAATGATAATTTAATTAAAGC AT AC AAA 
ACTGGCAAGGGAAGTGTTGTTATTAGGGCAAGATATCATATTGAAGAAAGAGCAGAAGATAGAAATGCTATAATTG 
TTACAGAAATACCTTATACGGTAAATAAATCTGCACTTCTTATGAAAGTTGCGCTTTTAGCAAAAGAAGAAAAGCT 
AGAAGGACTTTTAGATATAAGAGATGAATCTGATCGAGAAGGTATTAGGATAGTTCTTGAAGTTAAAAGAGGATTT 



t47 7 .nt 



f488 .aa 



t488 . aa 



f488 .nt 
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GATCCTCATGTTATTATGAATTTGCTTTATGAATATACTGAATTTAAAAAGCATTTTAGTATAAATAATTTAGCCC 
TTGTTAATGGTATTCCCAAACAGTTAAATTTAGAAGAATTGTTATTTGAATTTATTGAGCATAGAAAAAATATTAT 
CGAAAGACGTATTGAATTTGACTTGAGAAAGGCAAAAGAGAAAGCACATGTTCTTGAGGGATTAAATATTGCTTTA 
AATAATATAGATGAGGTTATTAAGATTATTAAATCATCTAAATTAGCAAAAGATGCAAGGGAGAGGCTTGTTTCGA 
ATTTTGGTCTTTCAGAGATTCAGGCCAATTCAGTTCTTGATATGAGGTTACAAAAACTTACAGCCCTTGAGATTTT 
TAAGCTTGAAGAGC-AGCTTAATATACTGTTAAGCTTAATAAAAGATTATGAAGATATTCTCTTGAATCCAGTAAGG 
ATTATTAATATTATAAGAGAAGAAACTATTAATTTAGGTTTGAAATTTGGCGATGAACGTCGAACTAAAATAATTT 
ATGATGAGGAGGTTTTAAAAACTAGTATGTCGGATTTAATGCAAAAAGAAAATATTGTTGTTATGCTTACAAAGAA 
AGGTTTCCTTAAAAGACTTTCACAAAATGAGTATAAATTGCAAGGTACGGGAGGAAAAGGACTAAGTTCGTTTGAT 
CTAAATGATGGAGATGAGATTGTTATTGCTTTGTGTGTCAATACTCATGATTATTTATTTATGATTTCAAATGAAG 
GAAAGCTTTATTTAATCAATGCTTATGAAATAAAAGATTCTTCAAGAGCTTCAAAAGGTCAGAATATTAGTGAGCT 
TATTAATTTAGGAGATCAAGAAGAAATATTAACTATTAAGAATAGTAAAGATTTAACTGATGATGCTTATTTATTG 
CTTACAACTGCAAGTGGAAAGATAGCTAGATTCGAATCTACAGATTTTAAAGCAGTAAAGTCACGAGGTGTTATTG 
TTATTAAACTGAATGATAAAGATTTTGTTACAAGTGCAGAGATTGTTTTTAAGGATGAAAAAGTAATTTGTCTTTC 
TAAAAAGGGTAGTGCATTTATATTTAATTCAAGGGATGTTAGGCTTACTAATAGAGGTACCCAAGGTGTTTGTGGA 
ATGAAATTAAAAGAAGGTGATTTGTTTGTTAAAGTTTTATCGGTTAAAGAAAATCCTTATCTTTTGATTGTTTCTG 
AAAATGGGTATGGAAAAAGGTTAAACATGTCTAAAATATCTGAGCTTAAAAGAGGAGCCACTGGTTATACTAGTTA 
TAAAAAATCTGATAAAAAAGCGGGTAGTGTTGTTGATGCTATAGCAGTTTCAGAGGATGATGAAATCTTGCTTGTA 
AGTAAACGTTCAAAAGCTTTAAGAACAGTAGCTGGAAAAGTATCTGAACAAGGCAAAGATGCTAGAGGAATTCAAG 
TATTATTTCTTGATAATGACAGCTTGGTTTCTGTTTCAAAATTTATTAAATAA 



ATGGCTACTAATATGGCACCTCATAATTTAAGAGAAATTTGTGATGCCATTGTTTACATGCTAGATAATGAGAATG 
CTTCTATATTTGATTTGCTTAAAATAGTTAAAGGGCCTGATTTCCCAACTTTTGGAGAGATTGTTTATAATGATAA 
TTTAATTAAAGCATACAAAACTGGCAAGGGAAGTGTTGTTATTAGGGCAAGATATCATATTGAAGAAAGAGCAGAA 
GATAGAAATGCTATAATTGTTACAGAAATACCTTATACGGTAAATAAATCTGCACTTCTTATGAAAGTTGCGCTTT 
TAGCAAAAGAAGAAAAGCTAGAAGGACTTTTAGATATAAGAGATGAATCTGATCGAGAAGGTATTAGGATAGTTCT 
TGAAGTTAAAAGAGGATTTGATCCTCATGTTATTATGAATTTGCTTTATGAATATACTGAATTTAAAAAGCATTTT 
AGTATAAATAATTTAGCCCTTGTTAATGGTATTCCCAAACAGTTAAATTTAGAAGAATTGTTATTTGAATTTATTG 
AGCATAGAAAAAATATTATCGAAAGA.CGTATTGAATTTGACTTGAGAAAGGCAAAAGAGAAAGCACATGTTCTTGA 
GGGATTAAATATTGCTTTAAATAATATAGATGAGGTTATTAAGATTATTAAATCATCTAAATTAGCAAAAGATGCA 
AGGGAGAGGCTTGTTTCGAATTTTGGTCTTTCAGAGATTCAGGCCAATTCAGTTCTTGATATGAGGTTACAAAAAC 
TTACAGCCCTTGAGATTTTTAAGCTTGAAGAGGAGCTTAATATACTGTTAAGCTTAATAAAAGATTATGAAGATAT 
TCTCTTGAATCCAGTAAGGATTATfAATATTATAAGAGAAGAAACTATTAATTTAGGTTTGAAATTTGGCGATGAA 
CGTCGAACTAAAATAATTTATGATGAGGAGGTTTTAAAAACTAGTATGTCGGATTTAATGCAAAAAGAAAATATTG 
TTGTTATGCTTACAAAGAAAGGTTTCCTTAAAAGACTTTCACAAAATGAGTATAAATTGCAAGGTACGGGAGGAAA 
AGGACTAAGTTCGTTTGATCTAAATGATGGAGATGAGATTGTTATTGCTTTGTGTGTCAATACTCATGATTATTTA 
TTTATGATTTCAAATGAAGGAAAGCTTTATTTAATCAATGCTTATGAAATAAAAGATTCTTCAAGAGCTTCAAAAG 
GTCAGAATATTAGTGAGCTTATTAATTTAGGAGATCAAGAAGAAATATTAACTATTAAGAATAGTAAAGATTTAAC 
TGATGATGCTTATTTATTGCTTACAACTGCAAGTGGAAAGATAGCTAGATTCGAATCTACAGATTTTAAAGCAGTA 
AAGTCACGAGC-TGTTATTGTTATTAAACTGAATGATAAAGATTTTGTTACAAGTGCAGAGATTGTTTTTAAGGATG 
AAAAAGTAATTTGTCTTTCTAAAAAGGGTAGTGCATTTATATTTAATTCAAGGGATGTTAGGCTTAGTAATAGAGG 
TACCCAAGGTGTTTGTGGAATGAAATTAAAAGAAGGTGATTTGTTTGTTAAAGTTTTATCGGTTAAAGAAAATCCT 
TATCTTTTGATTGTTTCTGAAAATGGGTATGGAAAAAGGTTAAACATGTCTAAAATATCTGAGCTTAAAAGA.GGAG 
CCACTGGTTATACTAGTTATAAAAAATCTGATAAAAAAGCGGGTAGTGTTGTTGATGCTATAGCAGTTTCAGAGGA 
TGATGAAATCTTGCTTGTAAGTAAACGTTCAAAAGCTTTAAGAACAGTAGCTGGAAAAGTATCTGAACAAGGCAAA 
GATGCTAGAGGAATTCAAGTATTATTTCTTGATAATGACAGCTTGGTTTCTGTTTCAAAATTTATTAAATAA 

f 494 .aa 

MFALIRKIFMIYFLCITLAGFAMIFIDSKFTEQP1WKENQSKINQHTIEPNLIMFTSSIGGFLGVYVGIWIFNYDK 
SNFYLNWGNLIILIYNIALIITVYSKSHS " 



t494 .aa 



TABLE 1. Nucleotide and Amino Acid Sequences 

MIFIDSKFTEQPNVKENQSKINQHTIEPNLIMFTSSIGGFLGVYVGIWIFNYDKSNFYLNWGNLIILIYNIALIIT 
VYSKSHS 

f 494 .nt 

ATGTTTGCATTAATTAGAAAAATATTTATGATCTATTTTTTATGCATTACTCTTGCAGGTTTTGCCATGATTTTTA 
TTGACAGCAAATTTACCGAACAGCCTAATGTTAAAGAAAATCAAAGCAAAATTAATCAACATACAATTGAACCCAA 
TTTAATCATGTTTACATCTTCTATAGGAGGATTTTTAGGTGTTTATGTTGGAATTTGGATCTTTAACTATGACAAA 
AGCAATTTTTACCTAAATTGGGGAAATTTAATAATATTAATATACAACATAGCCCTAATTATCACTGTATACTCAA 
AAT C AC AT AGTT AG 

t494.nt 

ATGATTTTTATTGACAGCAAATTTACCGAACAGCCTAATGTTAAAGAAAATCAAAGCAAAATTAATCAACATACAA 
TTGAACCCAATTTAATCATGTTTACATCTTCTATAGGAGGATTTTTAGGTGTTTATGTTGGAATTTGGATCTTTAA 
CTATGACAAAAGCAATTTTTACCTAAATTGGGGAAATTTAATAATATTAATATACAACATAGCCCTAATTATCACT 
GTATACTCAAAATCACATAGTTAG 

f516.aa 

MKKTPNTCIFLTLLIISNLNALANEEGNTNEKNDQPKQISNFFSPERGFIYSTGIGIGVGFFLNSNIKHLIFRPYY 
TFSNNTFDFLIVAMILTRESLNIPKKMQYFKSYIGGGINWHIANLIKKTKYFSATIGIGGRFYLSTNFIEDIRFYE 
KLPYVIEPYMFIEISSKKAIPLMGLDFKIDFLFLDTFNISFNFTIRYNFKDKNEMET 

t516 . aa 

NEEGNTNEKNDQPKQISNFFSPERGFIYSTGIGIGVGFFLNSNIKHLIFRPYYTFSNNTFDFLIVAMILTRESLNI 
PKKMQYFKSYIGGGINWHIANLIKKTKYFSATIGIGGRFYLSTNFIEDIRFYEKLPYVIEPYMFIEISSKKAIPLM 
GLDFKIDF LFLDT FNI S FNFT IRYNFKDKNEMET 

f51-6.nt 

ATGAAAAAAACTCCAAACACTTGTATTTTCTTAACATTGCTTATCATTTCCAATTTAAATGCACTTGCAAATGAAG 
AAGGCAATACTAATGAAAAAAATGATCAACCCAAACAAATCTCTAATTTTTTTAGCCCAGAAAGAGGGTTCATATA 
TTCAACAGGAATTGGGATTGGAGTTGGATTTTTTCTAAATTCAAATATTAAACACCTTATCTTTAGACCTTATTAT 
ACATTCTCTAATAATACTTTTGATTTTTTAATCGTTGCTATGATATTAACAAGGGAAAGCCTTAATATCCCCAAAA 
AAATGCAATACTTTAAATCTTATATTGGAGGAGGAATAAACTGGCACATTGCAAACTTAATTAAAAAAACAAAATA 
TTTTTCCGCCACCATTGGCATAGGTGGTCGTTTTTACCTATCTACAAACTTTATAGAAGACATTCGATTTTACGAA 
AAATTGCCTTATGTAATAGAGCCTTATATGTTTATTGAAATTTCTTCTAAAAAGGCAATTCCTTTAATGGGGTTAG 
ACTTTAAAATTGATTTTTTATTTTTAGATACATTTAACATTTCTTTTAATTTTACTATTAGATATAATTTTAAGGA 
CAAAAACGAGATGGAAACATGA 

t516.nt 

AATGAAGAAGGCAATACTAATGAAAAAAATGATCAACCCAAACAAATCTCTAATTTTTTTAGCCCAGAAAGAGGGT 
TCATATATTCAACAGGAATTGGGATTGGAGTTGGATTTTTTCTAAATTCAAATATTAAACACCTTATCTTTAGACC 
TTATTATACATTCTCTAATAATACTTTTGATTTTTTAATCGTTGCTATGATAT'TAACAAGGGAAAGCCTTAATATC 
C C C AAAAAAATGC AAT AC T TTAAATC TT AT ATTGGAGGAGGAAT AAACTGGC AC ATTGC AAAC TTAATT AAAAAAA 
CAAAATATTTTTCCGCCACCATTGGCATAGGTGGTCGTTTTTACCTATCTACAAACTTTATAGAAGACATTCGATT 
TTACGAAAAATTGCCTTATGTAATAGAGCCTTATATGTTTATTGAAATTTCTTCTAAAAAGGCAATTCCTTTAATG 
GGGTTAGACTTTAAAATTGATTTTTTATTTTTAGATACATTTAACATTTCTTTTAATTTTACTATTAGATATAATT 
TT AAGG AC AAAAAC GAG AT GGAAAC ATGA 



f517 .aa 
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TABLE 1. Nucleotide and Amino Acid Sequences 

MIPWASGGILIALSIAFVGIGPDGPNFAEHPFYKQIADIGSIAFGMMLPVLAGFIAMAIADKPGLTPGLVGGVMS 
G1WKAGFLGAIFAGFLAGYVARFLARRSVPEWLRPVMPIFVIPLISTIIVGFFMLYFGVYIGKFMGVLESGLKSLQ 
SNSETFGVLGKIFLGLVLGSMITVDMGGPFNKVAFLFGVGLIPQVPEIMGMVAAAIPVPPMAMGLATFLAPKLFEK! 
EEKESGKIAFLISFIGISEGAIPFAASDPGRVIPSIWGGAVSSIIAAFLGVANHAPHGGPIVLPVIDNKFGFIIA 
IAVGVAVATALVIFLKSLKLKESE 



DKPGLTPGLVGGVMSGNVKAGFLGAIFAGFLAGYVARFLARRSVPEWLRPVMPIFVIPLISTIIVGFFMLYFGVYI 
GKFMGVLESGLKSLQSNSETFGVLGKIFLGLVLGSMITVDMGGPFNKVAFLFGVGLIPQVPEIMGMVAAAIPVPPM 
AMGLATFLAPKLFENEEKESGKIAFLISFIGISEGAIPFAASDPGRVIPSIWGGAVSSIIAAFLGVANHAPHGGP 
IVLPVIDNKFGFIIAIAVGVAVATALVIFLKSLKLKESE 



ATGATTCCTGTTGTTGCAAGTGGAGGAATTTTAATTGCTCTTAGCATTGCTTTTGTTGGGATTGGACCTGATGGGC 
CTAATTTTGCTGAGCATCCATTTTATAAGCAGATTGCAGATATTGGTTCTATAGCTTTTGGGATGATGTTGCCCGT 
GGTTGCTGGTTTTATTGCAATGGCAATTGCTGATAAGCCTGGTCTTACCCCCGGTCTTGTTGGTGGAGTAATGTCT 
GGGAATGTAAAAGCAGGTTTCTTGGGCGCAATATTTGCGGGCTTTCTTGCAGGTTATGTTGCAAGGTTTTTAGCAA 
GAAGATCTGTTCCTGAGTGGTTAAGACCTGTAATGCCTATATTTGTAATTCCGCTAATAAGCACGATTATTGTCGG 
CTTTTTTATGCTGTATTTTGGTGTTTATATTGGAAAATTTATGGGGGTGCTTGAGAGTGGGCTTAAATCTTTACAG 
AGTAATTCGGAAACTTTTGGCGTGTTGGGTAAAATTTTCTTAGGCTTAGTACTAGGTTCAATGATTACTGTTGATA 
TGGGCGGACCTTTTAATAAAGTGGCATTTCTTTTTGGTGTAGGGCTAATTCCTCAAGTGCCAGAAATAATGGGAAT 
GGTAGCAGCAGCAATTCCTGTTCCTCCTATGGCTATGGGGCTTGCAACCTTTTTAGCACCTAAATTGTTTGAAAAT 
GAAGAAAAAGAATCTGGTAAAATAGCCTTTTTAATTTCATTTATTGGTATTAGCGAAGGAGCTATTCCTTTTGCTG 
CTAGTGATCCCGGACGGGTAATCCCTTCGATAGTGGTAGGGGGAGCTGTATCAAGCATTATTGCCGCTTTTTTAGG 
CGTTGCTAATCATGCTCCACACGGAGGACCAATAGTACTTCCTGTTATTGATAATAAATTTGGGTTTATTATTGCA 
ATTGCTGTTGGAGTTGCGGTTGCAACAGCTTTGGTAATTTTTTTGAAATCTTTAAAATTAAAGGAATCTGAATGA 



GATAAGCCTGGTCTTACCCCCGGTCTTGTTGGTGGAGTAATGTCTGGGAATGTAAAAGCAGGTTTCTTGGGCGCAA 
TATTTGCGGGCTTTCTTGCAGGTTATGTTGCAAGGTTTTTAGCAAGAAGATCTGTTCCTGAGTGGTTAAGACCTGT 
AATGCCTATATTTGTAATTCCGCTAATAAGCACCATTATTGTCGGCTTTTTTATGCTGTATTTTGGTGTTTATATT 
GGAAAATTTATGGGGGTGCTTGAGAGTGGGCTTAAATCTTTACAGAGTAATTCGGAAACTTTTGGCGTGTTGGGTA 
AAATTTTCTTAGGCTTAGTACTAGGTTCAATGATTACTGTTGATATGGGCGGACCTTTTAATAAAGTGGCATTTCT 
TTTTGGTGTAGGGCTAATTCCTCAAGTGCCAGAAATAATGGGAATGGTAGCAGCAGCAATTCCTGTTCCTCCTATG 
GCTATGGGGCTTGCAACCTTTTTAGCACCTAAATTGTTTGAAAATGAAGAAAAAGAATCTGGTAAAATAGCCTTTT 
TAATTTCATTTATTGGTATTAGCGAAGGAGCTATTCCTTTTGCTGCTAGTGATCCCGGACGGGTAATCCCTTCGAT 
AGTGGTAGGGGGAGCTGTATCAAGCATTATTGCCGCTTTTTTAGGCGTTGCTAATCATGCTCCACACGGAGGACCA 
ATAGTACTTCCTGTTATTGATAATAAATTTGGGTTTATTATTGCAATTGCTGTTGGAGTTGCGGTTGCAACAGCTT 
TGGT AATTTTTTT GAAAT C TTT AAAATT AAAGGAAT CTG AAT GA 



MIKIFKKIYILTLVLGMAHLSFASDNYMVRCSKEEDSTTCIAKLKEIKEKKNYDLFSMGIGIGDPIANIMITIPYI 
NIDFGYGGFIGLKSNNFENYLNGGIDVIFKKQIGQYMKIGGGIGIGADWSKTSLIPPNEEEETDYERIGAVIRIPF 
IMEYNFAKNLSIGFKIYPAVGPTILLTKPSILFEGIKFNFFGFGFIKFAFN 



t517.aa 



f517 .nt 



tS17 .nt 



f519 .aa 



t519 .aa 



DNYMVRCSKEEDSTTCIAKLKEIKEXKNYDLFSMGIGIGDPIANIMITIPYINIDFGYGGFIGLKSNNFENYLNGG 
IDVIFKKQIGQYMKIGGGIGIGADWSKTSLIPPNEEEETDYERIGAVIRIPFIMEYNFAKNLSIGFKIYPAVGPTI 
LLTKPS I LFEGIKFNFFGFGF IKFAFN 




TABLE 1 . Nucleotide and Amino Acid Sequences 
f519.nt 

ATGATAAAAATTTTTAAAAAAATATACATTTTAACATTAGTATTAGGTATGGCACACCTTTCTTTTGCATCTGACA 
ATTATATGGTCAGATGCAGCAAGGAAGAAGATTCAACCACCTGTATCGCAAAGCTTAAAGAAATAAAAGAAAAGAA 
AAATTATGACTTATTTTCAATGGGCATTGGAATAGGAGATCCTATTGCAAATATTATGATTACAATTCCTTATATA 
AATATTGATTTTGGATATGGAGGTTTTATTGGCCTTAAGTCAAACAATTTTGAAAATTATCTAAATGGTGGAATAG 
ACGTTATTTTTAAAAAGCAAATTGGACAATATATGAAAATTGGCGGCGGCATTGGAATAGGTGCGGATTGGTCAAA 
AACATCCCTTATACCCCCTAATGAAGAAGAAGAAACTGATTATGAGAGAATAGGCGCTGTTATAAGAATTCCTTTT 
ATAATGGAATATAATTTTGCAAAAAATTTATCCATAGGATTCAAAATTTATCCTGCAGTAGGGCCAACAATATTAC 
TAACAAAACCAAGCATTTTATTTGAAGGAATTAAATTCAATTTTTTTGGATTTGGATTCATAAAATTTGCATTTAA 
TTAA 

t519.nt 

GACAATTATATGGTCAGATGCAGCAAGGAAGAAGATTCAACCACCTGTATCGCAAAGCTTAAAGAAATAAAAGAAA 
AGAAAAATTATGACTTATTTTCAATGGGCATTGGAATAGGAGATCCTATTGCAAATATTATGATTACAATTCCTTA 
TATAAATATTGATTTTGGATATGGAGGTTTTATTGGCCTTAAGTCAAACAATTTTGAAAATTAT'CTAAATGGTGGA 
ATAGACGTTATTTTTAAAAAGCAAATTGGACAAIATATGAAAATTGGCGGCGGCATTGGAATAGGTGCGGATTGGT 
CAAAAACATCCCTTATACCCCCTAATGAAGAAGAAGAAACTGATTATGAGAGAATAGGCGCTGTTATAAGAATTCC 
TTTTATAATGGAATATAATTTTGCAAAAAATTTATCCATAGGATTCAAAATTTATCCTGCAGTAGGGCCAACAATA 
TTACTAACAAAACCAAGCATTTTATTTGAAGGAATTAAATTCAATTTTTTTGGATTTGGATTCATAAAATTTGCAT 
TTAATTAA 

f520.aa 

MRMLLATIILILTTGLLAAQSKSKSMTEDDFDFDKLLAKEESVRRLFGIGFGVGYPLANITISVPYVDIDLGYGGF 
VGLKPNWFLPYVVMGVDLLFKDEIHKNTMISGGIGIGADWSKGSPEKSNEKLEEEEENEAQQVASLQNRIGWIRL 
PLVIEYSFLKNIVIGFKAVATIGTTMLLGSPMSFEGARFNFLGTGFIKIYI 

t520.aa 

QSKSKSMTEDDFDFDKLLAKEESWRLFGIGFGVGYPLANITISVPYVDIDLGYGGFVGLKPNNFLPYWMGVDLL 
FKDEIHKNTMISGGIGIGADWSKGSPEKSNEKLEEEEENEAQQVASLQNRIGWIRLPLVIEYSFLKNIVIGFKAV 
ATIGTTMLLGSPMSFEGARFNFLGTGFIKIYI 

f520 .nt 

ATGAGAATGCTATTAGCAACAATAATACTTATATTAACAACGGGTTTATTAGCTGCACAATCCAAAAGCAAAAGTA 
TGACTGAAGATGACTTTGATTTTGATAAACTTCTTGCAAAAGAAGAGTCTGTGCGCCGTTTATTTGGCATAGGTTT 
TGGAGTTGGATATCCACTTGCAAACATTACAATATCTGTTCCATATGTAGACATAGACCTTGGGTACGGAGGATTC 
GTAGGGCTTAAACCCAACAATTTCTTGCCCTATGTTGTGATGGGTGTAGATCTTCTATTTAAAGATGAAATACACA 
AAAACACTATGATTTCTGGAGGCATTGGAATAGGTGCAGATTGGTCAAAAGGAAGTCCTGAAAAATCAAATGAAAA 
ACTTGAAGAAGAGGAAGAAAATGAAGCACAACAAGTAGCTTCTCTTCAAAATAGAATAGGGGTTGTGATAAGATTG 
CCTTTGGTAATAGAGTACAGCTTTCTTAAAAATATTGTGATTGGATTTAAAGCTGTTGCTACTATTGGAACAACTA 
TGCTACTTGGCAGCCCAATGTCATTTGAAGGAGCTAGATTTAATTTCTTAGGCACAGGCTTTATAAAAATATATAT 
ATAG 

t52 0 .nt 

CAATCCAAAAGCAAAAGTATGACTGAAGATGACTTTGATTTTGATAAACTTCTTGCAAAAGAAGAGTCTGTGCGCC 
GTTTATTTGGCATAGGTTTTGGAGTTGGATATCCACTTGCAAACATTACAATATCTGTTCCATATGTAGACATAGA 
CCTTGGC-TACGGAGGATTCGTAGGGCTTAAACCCAACAATTTCTTGCCCTATGTTGTGATGGGTGTAGATCTTCTA 
TTTAAAGATGAAATACACAAAAACACTATGATTTCTGGAGGCATTGGAATAGGTGCAGATTGGTCAAAAGGAAGTC 
CTGAAAAATCAAATGAAAAACTTGAAGAAGAGGAAGAAAATGAAGCACAACAAGTAGCTTCTCTTCAAAATAGAAT 
AGGGGTTGTGATAAGATTGCCTTTGGTAATAGAGTACAGCTTTCTTAAAAATATTGTGATTGGATTTAAAGCTGTT 
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TABLE 1. Nucleotide and Amino Acid Sequences 

GCTACTATTGGAACAACTATGCTACTTGGCAGCCCAATGTCATTTGAAGGAGCTAGATTTAATTTCTTAGGCACAG 
GC TTT ATAAAAATAT AT AT AT AG 



MNIKINFFFTLPIGIFLGLFFPLGIYSSLSHAFIRLSYLSLIPFLIFSIPLGIENIIENKNFKKLFGKTIYYGILT 
NLSGVAVSIIAATIYLPQRIPILEKTIQNTCFFEKEALLETFFPKNIFKIFTSSNPNLLSIYMISIIIGTSFYYAK 
QKGRIARELMLSASNLFYHANGFIWILNIGIIFITAiSrYAANLKNFKDYPNYTNSITFFLAWTIIILFVILPTISY 
RLTKSFKMIYKGIFVSFQNIIFSGLAKDSYSPYVILIEDIKNERINIKKSIIINIPLINFVSKFGTIFVSVISFFI 
ILKSYSSLPISIYEISYMSTLSFVFVFAFPHIPNSLIYIITMLCSTYTKGIELNVSNITPMLPILISLALLIDFAF 
NIAI IHI INFKSLKDQEKIN 



IENIIENKNFKKLFGKTIYYGILTNLSGVAVSIIAATIYLPQRIPILEKTIQNTCFFEKEALLETFFPKNIFKIFT 
SSNPNLLSIYMISIIIGTSFYYAKQKGRIARELMLSASNLFYHANGFIVNILNIGIIFITANYAANLKNFKDYPNY 
TNSITFFLAWTIIILFVILPTISYRLTKSFKMIYKGIFVSFQNIIFSGLAKDSYSPYVILIEDIKNERINIKKSII 
INIPLINFVSKFGTIFVSVISFFIILKSYSSLPISIYEISYMSTLSFVFVFAFPHIPNSLIYIITMLCSTYTKGIE 
LNVSNITPMLPILISLALLIDFAFNIAIIHIINFKELKDQEKIN 



ATGAATATAAAAATCAATTTTTTTTTCACTTTGCCTATTGGAATCTTTTTAGGATTGTTTTTCCCTCTTGGAATTT 
ATAGCTCCTTATCACATGCTTTTATAAGATTATCATACTTATCTCTTATTCCCTTTTTAATATTTTCAATTCCATT 
AGGAATTGAAAATATTATTGAAAATAAAAACTTTAAAAAGCTTTTTGGTAAAACAATTTATTATGGAATTTTAACT 
AACCTATCTGGAGTTGCTGTATCAATAATAGCTGCAACAATATATCTTCCGCAAAGAATTCCAATACTAGAAAAAA 
CAATACAAAATACATGTTTTTTTGAAAAAGAAGCTTTACTAGAAACATTCTTTCCAAAAAATATTTTCAAAATATT 
TACATCTAGCAATCCAAATCTACTAAGCATTTACATGATTTCAATAATAATAGGCACAAGTTTTTATTATGCAAAA 
CAAAAAGGCAGAATAGCTAGAGAACTGATGCTAAGCGCATCCAATCTTTTTTACCATGCAAATGGGTTTATTGTAA 
ACATATTAAATATAGGGATCATTTTTATAACAGCAAATTACGCTGCAAACTTAAAAAACTTCAAAGATTACCCAAA 
TTATACAAACAGCATAACATTCTTTTTGGCATGGACAATTATAATTTTATTCGTAATATTGCCAACAATTAGTTAT 
AGATTAACAAAAAGTTTTAAAATGATATATAAAGGCATTTTTGTATCATTTCAAAACATAATATTTTCAGGACTTG 
CAAAAGATTCTTATTCCCCTTATGTGATATTAATAGAAGATATTAAAAACGAAAGAATAAATATAAAAAAATCCAT 
AATTATAAACATACCTTTAATAAATTTTGTATCTAAATTTGGCACTATTTTTGTTTCAGTAATATCATTTTTTATA 
ATTTTAAAATCATATTCTAGCTTACCCATTTCTATTTATGAAATAAGCTATATGAGCACTTTATCATTTGTTTTTG 
TCTTTGCATTTCCTCATATACCAAATAGTTTAATTTATATAATTACAATGCTTTGCTCTACATATACAAAAGGAAT 
AGAGCTAAATGTTTCAAACATAACACCAATGCTGCCGATATTAATCTCTTTGGCTTTACTAATCGACTTTGCTTTT 
AACATTGCAATCATTCATATAATAAACTTCAAAGAATTAAAAGATCAAGAAAAAATTAATTAA 



ATTGAAAATATTATTGAAAATAAAAACTTTAAAAAGCTTTTTGGTAAAACAATTTATTATGGAATTTTAACTAACC 
TATCTGGAGTTGCTGTATCAATAATAGCTGCAACAATATATCTTCCGCAAAGAATTCCAATACTAGAAAAAACAAT 
ACAAAATACATGTTTTTTTGAAAAAGAAGCTTTACTAGAAACATTCTTTCCAAAAAATATTTTCAAAATATTTACA 
TCTAGCAATCCAAATCTACTAAGCATTTACATGATTTCAATAATAATAGGCACAAGTTTTTATTATGCAAAACAAA 
AAGGCAGAATAGCTAGAGAACTGATGCTAAGCGCATCCAATCTTTTTTACCATGCAAATGGGTTTATTGTAAACAT 
ATTAAATATAGGGATCATTTTTATAACAGCAAATTACGCTGCAAACTTAAAAAACTTCAAAGATTACCCAAATTAT 
ACAAACAGCATAACATTCTTTTTGGCATGGACAATTATAATTTTATTCGTAATATTGCCAACAATTAGTTATAGAT 
TAACAAAAAGTTTTAAAATGATATATAAAGGCATTTTTGTATCATTTCAAAACATAATATTTTCAGGACTTGCAAA 
AGATTCTTATTCCCCTTATGTGATATTAATAGAAGATATTAAAAACGAAAGAATAAATATAAAAAAATCCATAATT 
ATAAACATACCTTTAATAAATTTTGTATCTAAATTTGGCACTATTTTTGTTTCAGTAATATCATTTTTTATAATTT 
TAAAATCATATTCTAGCTTACCCATTTCTATTTATGAAATAAGCTATATGAGCACTTTATCATTTGTTTTTGTCTT 
TGCATTTCCTCATATACCAAATAGTTTAATTTATATAATTACAATGCTTTGCTCTACATATACAAAAGGAATAGAG 
CTAAATGTTTCAAACATAACACCAATGCTGCCGATATTAATCTCTTTGGCTTTACTAATCGACTTTGCTTTTAACA 
TTGCAATCATTCATATAATAAACTTCAAAGAATTAAAAGATCAAGAAAAAATTAATTAA 



f 523 . aa 



t523 .aa 



f523 .nt 



£523 .nt 
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TABLE 1 . Nucleotide and Amino Acid Sequences 
f526.aa 

MKKEFIMLLLLLQTIMNLNSINTNTSTSIVKELQKNLYIFNSKEYQKDKDTLNEFINSININDKEILQSLEKIKNH 
LFIISVFFNNKKGILIALNLGAEINFKYKISPISISIINNEFEITKILIDYGISLNQIDDTGYSPIFWAIYTNNEK 
IFEFLKESGADLSFTLKNRKTPMQAAIETENIKLIKSLEKKKIYIDDNFKKKLKKLKNKEIVRILVK 

t526.aa 

NSINTNTSTSIVKELQKNLYIFNSKEYQKDKDTLNEFINSININDKEILQSLEKIKNELFIISVFFNNKKGILIAL 
NLGAEINFKYKISPISISIINUEFEITKILIDYGISLNQIDDTGYSPIFWAIYTNNEKIFEFLKESGADLSFTLKN 
RKTPMQAAIETENIKLIKSLEKKKIYIDDNFKKKLKKLKNKEIVRILVK 

f526 .nt 

ATGAAAAAAGAATTCATTATGCTTTTACTGTTATTGCAAACAATAATGAATTTAAACTCAATAAATACTAATACAA 
GTACTTCAATAGTAAAAGAATTGCAAAAAAATTTATATATTTTCAATAGCAAAGAATATCAAAAAGATAAAGACAC 
TTTAAATGAATTTATAAATTCAATAAATATAAATGACAAAGAAATCTTACAAAGTTTAGAAAAAATCAAAAATGAG 
CTTTTTATAATATCTGTTTTTTTCAACAATAAAAAAGGGATTTTAATTGCACTAAATCTTGGAGCAGAAATAAACT 
TTAAATATAAAATATCTCCAATTTCAATTTCAATAATAAACAATGAATTTGAAATCACAAAAATATTGATAGATTA 
CGGAATAAGCCTTAATCAAATAGATGATACAGGTTATTCTCCAATATTTTGGGCAATATATACTAATAACGAAAAA 
ATATTTGAATTTTTAAAAGAAAGCGGAGCTGATTTAAGTTTCACACTTAAAAATAGAAAAACACCAATGCAAGCCG 
CAATAGAAACAGAAAATATAAAACTAATTAAATCTCTGGAAAAGAAAAAAATTTACATTGACGACAATTTCAAAAA 
AAAACTTAAAAAGCTTAAAAACAAAGAAATAGTTCGAATTTTAGTAAAATAG 

t526 .nt 

AACTCAATAAATACTAATACAAGTACTTCAATAGTAAAAGAATTGCAAAAAAATTTATATATTTTCAATAGCAAAG 
AATATCAAAAAGATAAAGACACTTTAAATGAATTTATAAATTCAATAAATATAAATGACAAAGAAATCTTACAAAG 
TTTAGAAAAAATCAAAAATGAGCTTTTTATAATATCTGTTTTTTTCAACAATAAAAAAGGGATTTTAATTGCACTA 
AATCTTGGAGCAGAAATAAACTTTAAATATAAAATATCTCCAATTTCAATTTCAATAATAAACAATGAATTTGAAA 
TCACAAAAATATTGATAGATTACGGAATAAGCCTTAATCAAATAGATGATACAGGTTATTCTCCAATATTTTGGGC 
AATATATACTAATAACGAAAAAATATTTGAATTTTTAAAAGAAAGCGGAGCTGATTTAAGTTTCACACTTAAAAAT 
AGAAAAACACCAATGCAAGCCGCAATAGAAACAGAAAATATAAAACTAATTAAATCTCTGGAAAAGAAAAAAATTT 
ACATTGACGACAATTTCAAAAAAAAACTTAAAAAGCTTAAAAACAAAGAAATAGTTCGAATTTTAGTAAAATAG 

f544.aa 

MTKNRIIWLLVLMVSSTFTATIISNYQNLMLSLWLANFIPLLIxmTSGNAGSQASALIIRELALGTVKVKDFFKVF 
LKEICVSILVGAILASWFLRIVFFVAPHHSDKLKIAFWSSCLMVSLTVAKILGGLLPIVAKLLKLDPALMAGPL 
I TT I ADAI T L I AYFNI AKWVL VS YAV 

t544.aa 

STFTATIISNYQNLMLSLWLANFIPLLMDTSGNAGSQASALIIRELALGTVKVKDFFKVFLKEICVSILVGAIIA 
SVNFLRIVFFVAPHHSDKLKIAFWSSCLMVSLTVAKILGGLLPIVAKLLKLDPALMAGPLITTIADAITLIAYFN 
IAKWVLVSYAV 

f544.nt 

ATGACAAAAAATAGAATAATTTGGCTTTTAGTTCTTATGGTGTCTTCTACTTTTACAGCTACAATTATTTCAAATT 
ATCAAAATTTAATGTTGTCTTTAGTGGTTTTAGCTAATTTTATTCCCCTTTTAATGGATACTTCAGGCAATGCCGG 
CTCTCAGGCATCTGCGCTAATAATTCGTGAGCTTGCTCTTGGTACTGTCAAGGTAAAAGATTTTTTTAAAGTGTTT 
TTAAAGGAAATATGTGTTAGCATTCTAGTGGGAGCAATTCTTGCTAGTGTTAATTTTTTAAGAATTGTCTTTTTTG 
TAGCTCCACACCATTCTGATAAGCTGAAAATAGCTTTTGTAGTTTCATCTTGCTTGATGGTAAGTTTGACAGTAGC 
AAAGATATTGGGAGGTCTTTTACCCATTGTTGCTAAACTTTTAAAGTTGGATCCAGCACTTATGGCAGGCCCTTTA 




TABLE 1. Nucleotide and Amino Acid Sequences 

ATCACTACAATTGCAGATGCTATTACTTTAATAGCTTATTTTAATATAGCAAAATGGGTTTTAGTTAGCTATGCTG 
TTTAA 

" t544 .nt 

TCTACTTTTACAGCTACAATTATTTCAAATTATCAAAATTTAATGTTGTCTTTAGTGGTTTTAGCTAATTTTATTC 
CCCTTTTAATGGATACTTCAGGCAATGCCGGCTCTCAGGCATCTGCGCTAATAATTCGTGAGCTTGCTCTTGGTAC 
TGTCAAGGTAAAAGATTTTTTTAAAGTGTTTTTAAAGGAAATATGTGTTAGCATTCTAGTGGGAGCAATTCTTGCT 
AGTGTTAATTTTTTAAGAATTGTCTTTTTTGTAGCTCCACACCATTCTGATAAGCTGAAAATAGCTTTTGTAGTTT 
CATCTTGCTTGATGGTAAGTTTGACAGTAGCAAAGATATTGGGAGGTCTTTTACCCATTGTTGCTAAACTTTTAAA 
GTTGGATCCAGCACTTATGGCAGGCCCTTTAATCACTACAATTGCAGATGCTATTACTTTAATAGCTTATTTTAAT 
ATAGCAAAATGGGTTTTAGTTAGCTATGCTGTTTAA 

f545 .aa 

MTKNRIIWLLVLMVSSTFTATIISINri'QNLMLSLVVLANFIPLLMDTSGNAGSOASALIIRELALGTVKVKDFFKVF 
LKEICVSILVGAILASVNFLRIVFFVAPHHSDKLKIAFWSSCLMVSLTVAKILGGLLPIVAKLLKLDPALMAGPL 
ITTIADAITLIAYFNIAKWVLVSYAV 

• t545 .aa 

GSQASALIIRELALGTVKVKDFFKVFLKEICVSILVGAILASWFLRIVFFVAPHHSDKLKIAFWSSCLMVSLTV 
' AKILGGLLPIVAKLLKLDPALMAGPLITTIADAITL.IAYFNIAKWVLVSYAV 

f545 .nt 

ATGACAAAAAATAGAATAATTTGGCTTTTAGTTCTTATGGTGTCTTCTACTTTTACAGCTACAATTATTTCAAATT 
ATCAAAATTTAATGTTGTCTTTAGTGGTTTTAGCTAATTTTATTCCCCTTTTAATGGATACTTCAGGCAATGCCGG 
CTCTCAGGCATCTGCGCTAATAATTCGTGAGCTTGCTCTTGGTACTGTCAAGGTAAAAGATTTTTTTAAAGTGTTT 
TTAAAGGAAATATGTGTTAGCATTCTAGTGGGAGCAATTCTTGCTAGTGTTAATTTTTTAAGAATTGTCTTTTTTG 
TAGCTCCACACCATTCTGATAAGCTGAAAATAGCTTTTGTAGTTTCATCTTGCTTGATGGTAAGTTTGACAGTAGC 
AAAGATATTGGGAGGTCTTTTACCCATTGTTGCTAAACTTTTAAAGTTGGATCCAGCACTTATGGCAGGCCCTTTA 
ATCACTACAATTGCAGATGCTATTACTTTAATAGCTTATTTTAATATAGCAAAATGGGTTTTAGTTAGCTATGCTG 
TTTAA 

t545.nt 

GGCTCTCAGGCATCTGCGCTAATAATTCGTGAGCTTGCTCTTGGTACTGTCAAGGTAAAAGATTTTTTTAAAGTGT 
TTTTAAAGGAAATATGTGTTAGCATTCTAGTGGGAGCAATTCTTGCTAGTGTTAATTTTTTAAGAATTGTCTTTTT 
TGTAGCTCCACACCATTCTGATAAGCTGAAAATAGCTTTTGTAGTTTCATCTTGCTTGATGGTAAGTTTGACAGTA 
GCAAAGATATTGGGAGGTCTTTTACCCATTGTTGCTAAACTTTTAAAGTTGGATCCAGCACTTATGGCAGGCCCTT 
TAATCACTACAATTGCAGATGCTATTACTTTAATAGCTTATTTTAATATAGCAAAATGGGTTTTAGTTAGCTATGC 
TGTTTAA 

f5 77 .aa 

MRIKNLILIAILLISPSCSTNK3SJIWLTDNKTIPFYINQFNISNKANFIIKFRNNIDLQTIEKENAQIIISKNIGN 
TNIANHFKSVKINYNPDYPILKHIFKQFNYKIIPLGFDIPILIYKNTHHIKKYINTKYLKEEYENFIKDGKFFISP 
WSENLFWISQIMWRFSFEKNKLimJENQILKMLEYFSSFLNTKQMDLQKDFFNKYGYLKLNKILLNKKSLLIA 
GLSDITFYNSLSEQEKSQIKFSYLINDNNEIVISNPNFIGILETSVLTKKFINWILYKKTQKTLIGFNNQSQSNIC 
FGFANGFTPYKELNLKIKHSIDGISPFIIDETQINSHSWLSKKTIEKENLLINEWFFSKANNLKKNKN 

t577.aa 



NKNIVVLTDNKTIPFYINQFNIENKANFIIKFRKNIDLQTIEKENAQIIISKlSriGISrTNIAiJHFKSVKINYNPDYPI 
LKHIFKQFNYKIIPLGFDIPILIYKNTHHIKKYINTKYLKEEYENFIKDGKFFISPYVSENLFYVISQINNVRFSF 




TABLE 1. Nucleotide and Amino Acid Sequences 

EKNKLNYNENQILKMLEYFSSFLNTKQMDLQKDFFNKYGYLKLNKILLNKKSLLIAGLSDITFYNSLSEQEKSQIK 
FSYLINDNNEIVISNPNFIGILETSVLTKKFINWILYKKTQKTLIGFNNQSQSNICFGFANGFTPYKELNLKIKHS 
IDGISPFIIDETQINSHSWLSKKTIEKENLLINEWFFSKANNLKKNKN 

f577.nt 

ATC-AGAATAAAAAATTTAATACTAATAGCAATTTTATTAATTAGCCCTAGCTGTTCAACAAATAAGAACATCGTTG 
TACTAACTGACAATAAAACAATACCATTTTATATAAATCAATTTAATATAGAAAATAAAGCAAATTTTATAATTAA 
GTTTAGAAATAATATTGATCTGCAAACAATAGAAAAAGAAAATGCACAAATAATTATTTCTAAAAACATTGGTAAC 
ACAAATATTGCTAACCATTTTAAATCTGTAAAAATCAATTATAATCCAGATTATCCTATCTTAAAGCATATTTTCA 
AGCAATTTAACTACAAAATTATTCCATTGGGCTTTGACATTCCTATTTTAATCTATAAAAATACACATCATATTAA 
AAAATACATAAACACTAAATATCTAAAAGAAGAATACGAAAATTTCATTAAAGATGGAAAATTTTTTATATCGCCT 
TATGTTTCTGAAAATTTA.TTTTATGTGATTTCTCAAATAAATAATGTGAGATTTTCTTTTGAAAAAAATAAATTAA 
ATTATAATGAGAATCAAATTTTAAAAATGCTAGAATATTTCTCATCATTTTTAAATACAAAACAAATGGACTTGCA 
AAAAGATTTCTTTAATAAATACGGCTACCTAAAGTTAAATAAAATATTGCTTAATAAAAAATCTCTTTTAATAGCA 
GGATTGAGCGATATAACCTTCTACAATAGCTTAAGCGAACAAGAGAAGTCACAAATAAAATTTTCCTATTTAATAA 
ACGATAACAATGAAATTGTTATCTCAAACCCAAATTTTATTGGCATTTTAGAAACATCTGTTTTAACTAAAAAATT 
TATCAACTGGATATTGTATAAAAAAACTCAAAAAACCCTAATTGGATTTAACAATCAATCCCAATCAAATATATGT 
TTTGGATTTGCCAATGGTTTTACCCCTTACAAAGAATTAAATTTAAAAATAAAACATTCAATTGATGGAATATCTC 
CTTTTATTATTGACGAAACTCAAATCAATAGCCATTCCTATGTATTAAGCAAAAAAACAATTGAAAAAGAAAACTT 
ACTAATAAATGAATGGTTTTTCTCTAAAGCTAATAATCTAAAAAAAAATAAAAATTAA 

t577 .nt 

AATAAGAACATCGTTGTACTAACTGACAATAAAACAATACCATTTTATATAAATCAATTTAATATAGAAAATAAAG 
CAAATTTTATAATTAAGTTTAGAAATAATATTGATCTGCAAACAATAGAAAAAGAAAATGCACAAATAATTATTTC 
TAAAAACATTGGTAACACAAATATTGCTAACCATTTTAAATCTGTAAAAATCAATTATAATCCAGATTATCCTATC 
TTAAAGCATATTTTCAAGCAATTTAACTACAAAATTATTCCATTGGGCTTTGACATTCCTATTTTAATCTATAAAA 
ATACACATCATATTAAAAAATACATAAACACTAAATATCTAAAAGAAGAATACGAAAATTTCATTAAAGATGGAAA 
ATTTTTTATATCGCCTTATGTTTCTGAAAATTTATTTTATGTGATTTCTCAAATAAATAATGTGAGATTTTCTTTT 
GAAAAAAATAAATTAAATTATAATGAGAATCAAATTTTAAAAATGCTAGAATATTTCTCATCATTTTTAAATACAA 
AACAAATGGACTTGCAAAAAGATTTCTTTAATAAATACGGCTACCTAAAGTTAAATAAAATATTGCTTAATAAAAA 
ATCTCTTTTAATAGCAGGATTGAGCGATATAACCTTCTACAATAGCTTAAGCGAACAAGAGAAGTCACAAATAAAA 
TTTTCCTATTTAATAAACGATAACAATGAAATTGTTATCTCAAACCCAAATTTTATTGGCATTTTAGAAACATCTG 
TTTTAACTAAAAAATTTATCAACTGGATATTGTATAAAAAAACTCAAAAAACCCTAATTGGATTTAACAATCAATC 
CCAATCAAATATATGTTTTGGATTTGCCAATGGTTTTACCCCTTACAAAGAATTAAATTTAAAAATAAAACATTCA 
ATTGATGGAATATCTCCTTTTATTATTGACGAAACTCAAATCAATAGCCATTCCTATGTATTAAGCAAAAAAACAA 
TTGAAAAAGAAAACTTACTAATAAATGAATGGTTTTTCTCTAAAGCTAATAATCTAAAAAAAAATAAAAATTAA 

f584.aa 

MIKTILLLVLYPVVVFSQISANQYFEGIYAKYQNIEDMQATINFTLKGLKQTGVLLYKFPDKFIINLDSNNQVFVS 
DGEFLTVYVPSLGTSFNQQLLKGSSGGGLMKVLNSEYSVSYTNSPNLEDLDSSEPGKYIKLTFSRKLYKGAATINS 
FIIAFAPDGIIRRITAFPTSGGREIVIDLTAVKFNVGILDSKFKYDPPKSSNKVDNFLYDIKKN 



t584.aa 

QISANQYFEGIYAKYQNIEDMQATINFTLKGLKQTGVLLYKFPDKFIINLDSNNQVFVSDGEFLTVYVPSLGTSFN 
QQLLKGSSGGGLMKVLNSEYSVSYTNSPNLEDLDSSEPGKYIKLTFSRKLYKGAATINSFIIAFAPDGIIRRITAF 
PTSGGRE I VIDLTAVKFNVGI LDS KFKYDP PKS SNKVDNFLYDI KKN 

f 584 .nt 



ATGATAAAAACAATACTTTTATTAGTTTTGTATCCTGTTGTTGTGTTTTCTCAAATATCTGCAAATCAATATTTTG 
AAGGAATTTATGCTAAATATCAAAATATAGAGGACATGCAAGCAACAATTAATTTTACTTTAAAGGGGTTAAAGCA 




TABLE 1 . Nucleotide and Amino Acid Sequences 

AACAGGTGTTTTGCTTTATAAGTTTCCAGACAAGTTTATTATCAATTTAGATTCAAATAATCAAGTTTTTGTAAGT 
GATGGTGAATTTTTGACAGTTTATGTTCCATCTCTTGGGACTTCTTTTAATCAGCAATTATTAAAGGGTAGTAGTG 
GGGGAGGTCTTATGAAAGTTTTAAATAGTGAGTATAGCGTATCTTATACCAATTCTCCAAATTTAGAAGATCTCGA 
TTCATCTGAGCCTGGAAAATATATTAAATTAACCTTTTCTAGAAAGCTTTACAAGGGGGCTGCTACTATTAATTCT 
TTTATTATTGCTTTTGCTCCGGATGGAATAATTAGAAGAATTACTGCTTTTCCTACTAGTGGTGGGCGCC-AAATAG 
TTATTGATTTGACTGCTGTGAAGTTTAATGTTGGAATTCTTGATAGCAAATTTAAATATGATCCTCCAAAATCTTC 
AAATAAGGTAGATAATTTTTTATATGATATTAAAAAAAATTAA 

t584 .nt 

CAAATATCTGCAAATCAATATTTTGAAGGAATTTATGCTAAATATCAAAATATAGAGGACATGCAAGCAACAATTA 
ATTTTACTTTAAAGGGGTTAAAGCAAACAGGTGTTTTGCTTTATAAGTTTCCAGACAAGTTTATTATCAATTTAGA 
TTCAAATAATCAAGTTTTTGTAAGTGATGGTGAATTTTTGACAGTTTATGTTCCATCTCTTGGGACTTCTTTTAAT 
CAGCAATTA.TTAAAGGGTAGTAGTGGGGGAGGTCTTATGAAAGTTTTAAATAGTGAGTATAGCGTATCTTATACCA 
f „ t ATTCTCCAAATTTAGAAGATCTCGATTCATCTGAGCCTGGAAAATATATTAAATTAACCTTTTCTAGAAAGCTTTA 
^ CAAGGGGGCTGCTACTATTAATTCTTTTATTATTGCTTTTGCTCCGGATGGAATAATTAGAAGAATTACTGCTTTT 
CCTACTAGTGGTGGGCGCGAAATAGTTATTGATTTGACTGCTGTGAAGTTTAATGTTGGAATTCTTGATAGCAAAT 
klJ TTAAATATGATCCTCCAAAATCTTCAAATAAGGTAGATAATTTTTTATATGATATTAAAAAAAATTAA 

; 

Q f596.aa 

jh mkerclyllvfvalcvnmlfsddyliydfdlslneflevstrkdnlepmvdsnrillfyppkkeirkifaafdfdq 

□ yskkylfkknehgvffvkwiphgtssikyrlivdgvwtndeynknwynedlipfskieiakekssyislrnpiq 

?! sydimeieifyigrpgqivtiagsfnnfnpflimliekednkgiytiklknlpkdriyyyfidsgnkvidki^vnr 

;;;;;| inlyfvegidnkidfevsyfdhk 

«;! 

[J J t596.aa 

Q DDYL IYDFDLSLNEFLEVSTRKDNLEPMVDSNRILLFYPPKKEIRKIFAAFDFDQYSKKYLFKKNEHGVFFVKWI 

Q phgtssikyr li^gvt^tndeynk^ivvynedlipfskieiakekssyislrnpiqsydnneieifyigrpgqivti 
agsfnnfnpflnfxiekednkgiytiklknlpkdriyyyfidsgnkvidknnwrinlyfvegidnkidfevsyfd 

HK 

£596. nt 

atgaaagaaaggtgtttgtatttattggtttttgtagctttatgtgttaacaatcttttttcagatgattatttaa 
tttatgactttgatttaagtttaaatgaatttctagaagtttcaacaagaaaagacaatcttgagcctatggttga 
ttccaatcgtatattattgttttatcctcctaaaaaagaaattagaaaaatttttgctgcctttgactttgatcag 
tattctaagaaatatttattcaaaaaaaatgagcatggagttttttttgttaaagttaatattcctcatggcacaa 
gcagtataaaatataggcttattgtagacggtgtttggactaatgacgagtataataaaaatgtagtttataatga 
ggatttaatcccattttctaaaattgagatcgctaaagagaagtccagctatatttctttgagaaatccaatacaa 
tcatatgataacaatgaaattgaaattttttacataggtcgtcctggaca.\atagttacaatagctggtagtttta 
acaattttaatccttttttaaataggcttattgagaaagaggacaataagggaatttatactattaagcttaaaaa 

TTTACCCAAGGATAGAATTTATTATTATTTTATTGATTCTGGTAACAAAGTAATAGATAAAAATAATGTTAATAGA 

attaatttatattttgttgaggc-aattgataataaaatagatttcgaagtttcctattttgatcataagtaa 
t5 9-6.nt 

gatgattatttaatttatgactttgatttaagtttaaatgaatttctagaagtttcaacaagaaaagacaatcttg 
agcctatggttgattccaatcgtatattattgttttatcctcctaaaaaagaaattagaaaaatttttgctgcctt 
tgactttgatcagtattctaagaaatatttattcaaaaaaaatgagcatggagttttttttgttaaagttaatatt 
cctcatggcacaagcagtataaaatataggcttattgtagacggtgtttggactaatgacgagtataataaaaatg 
tagtttataatgaggatttaatcccattttctaaaattgagatcgctaaagagaagtccagctatatttctttgag 
aaatccaatacaatcatatgataacaatgaaattgaaattttttacataggtcgtcctggacaaatagttacaata 
gctggtagttttaacaattttaatccttttttaaataggcttattgagaaagaggacaataagggaatttatacta 
ttaagcttaaaaatttacccaaggatagaatttattattattttattgattctggtaacaaagtaatagataaaaa 
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TAATGTTAATAGAATTAATTTATATTTTGTTGAGGGAATTGATAATAAAATAGATTTCGAAGTTTCCTATTTTGAT 
CATAAGTAA 



f598.aa 

MRQRVMIAMALSCHPSLLIADEPTTALDVTIQEQILLLIKNLSKKFWTSTIFITHDLAWAEICDTVSVMYQGKIV 
EEGTVEEIFNNPKHPYTIGLLKSILTLEHDPNKKLYSTKENPMKITKTSTEEF 

t598 .aa 

EPTTALDVTIQEQILLLIKNLSKKFNTSTIFITHDLAWAEICDTVSVMYQGKIVEEGTVEEIFNNPKHPYTIGLL 
K3ILTLEHDPNKKLYSTKENPMKITKTSTEEF 

f598 .nt 

ATGAGACAAAGAGTTATGATTGCCATGGCTCTTAGCTGTCATCCATCCTTATTAATAGCAGATGAACCAACAACAG 
CCCTTGATGTTACAATCCAAGAGCAAATATTATTATTAATCAAAAACCTATCTAAAAAATTCAATACTTCTACCAT 
ATTTATAACTCATGATCTTGCGGTTGTTGCTGAAATTTGTGATACAGTATCTGTAATGTATCAAGGAAAAATTGTA 
GAAGAAGGAACAGTAGAGGAAATATTTAACAATCCTAAGCATCCTTACACCATTGGGCTTTTAAAATCAATTCTTA 
CGCTAGAACACGATCCAAATAAAAAGCTTTATTCAACAAAAGAAAACCCTATGAAGATCACAAAAACCAGCACCGA 
GGAGTTTTAA 

t598 .nt 

GAACCAACAACAGCCCTTGATGTTACAATCCAAGAGCAAATATTATTATTAATCAAAAACCTATCTAAAAAATTCA 
ATACTTCTACCATATTTATAACTCATGATCTTGCGGTTGTTGCTGAAATTTGTGATACAGTATCTGTAATGTATCA 
AGGAAAAATTGTAGAAGAAGGAACAGTAGAGGAAATATTTAACAATCCTAAGCATCCTTACACCATTGGGCTTTTA 
AAATCAATTCTTACGCTAGAACACGATCCAAATAAAAAGCTTTATTCAACAAAAGAAAACCCTATGAAGATCACAA 
AAACCAGCACCGAGGAGTTTTAA 

f600.aa 

MAIMERSIIGLFIALAFVSWLTVARWRGQVQSLSSSEFIQAAKTLGATNQRIILKHLIPNSIGMIVIFTTIRVPS 
FIMAEAFLSFLGLGISAPMTSWGELVQNGIATFVEYPWKVFIPAIVMTIFLLFMNFLGDGLRDAFDPKDSI 



C600 . aa 

RWRGQVQSLSSSEFIQAAKTLGATNQRIILKHLIPNSIGMIVIFTTIRVPSFIMAEAFLSFLGLGISAPMTSWGE 
LVQNGIATFVEYPWKVFIPAIVMTIFLLFMNFLGDGLRDAFDPKDSI 

foOO .nt 

ATGGCAATAATGGAAAGAAGTATAATCGGCTTATTCATAGCACTTGCATTTGTATCATGGTTAACAGTAGCTCGAG 
TTGTACGAGGCCAAGTACAATCACTATCAAGTTCGGAATTTATACAAGCAGCCAAAACCCTTGGTGCAACAAATCA 
AAGAATAATCTTAAAACACTTGATCCCTAATAGCATTGGAATGATAGTTATATTCACAACAATAAGGGTTCCAAGC 
TTTATTATGGCTGAAGCATTTTTATCCTTTTTAGGACTTGGAATTTCAGCTCCAATGACAAGCTGGGGAGAATTAG 
TGCAAAATGGAATTGCTACATTTGTTGAATATCCATGGAAAGTTTTTATTCCAGCTATAGTTATGACAATATTTCT 
ATTATTTATGAACTTTTTAGGTGATGGGCTAAGGGATGCTTTTGATCCAAAAGATAGCATCTAA 

£6 00 .nt 

CGAGTTGTACGAGGCCAAGTACAATCACTATCAAGTTCGGAATTTATACAAGCAGCCAAAACCCTTGGTGCAACAA, 
ATCAAAGAATAATCTTAAAACACTTGATCCCTAATAGCATTGGAATGATAGTTATATTCACAACAATAAGGGTTCC 
AAGCTTTATTATGGCTGAAGCATTTTTATCCTTTTTAGGACTTGGAATTTCAGCTCCAATGACAAGCTGGGGAGAA 
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TTAGTGCAAAATGGAATTGCTACATTTGTTGAATATCCATGGAAAGTTTTTATTCCAGCTATAGTTATGACAATAT 
TTCTATTATTTATGAACTTTTTAGGTGATGGGCTAAGGGATGCTTTTGATCCAAAAGATAGCATCTAA 

f 603 .aa 

MLKFTLKKILGIIPTLLVIIFLCFFVMRMAPGSPFDSEKPIDPQVKARLMEKYHLDKPFYIQAFYYITNALRGDLG 
PSLKKKDLTVSQYIKLGFPKSLTLGVISLIISLSIGIPIGILAAIYKNTYVDYIITSIAILGISIPLFVIGPILQY 
FFAIKWGLLYTSGWITERGGFSNLILPIITLSMPNVAIFARIIRGSMLEIIQSDFIRTARAKGLSFKKIVIKHMLS 
GAMLPWSYIGPAFAAIISGSWIEKIFRIAGMGMFITESALNRDYPVLMGGLLVYSIILLISILISDIIYKILDP 
RV 

t603 .aa 

SPFDSEKPIDPQVKARLMEKYHLDKPFYIQAFYYITNALRGDLGPSLKKKDLTVSQYIKLGFPKSLTLGVISLIIS 
LSIGIPIGILAAIYKNTYVDYIITSIAILGISIPLFVIGPILQYFFAIKWGLLYTSGWITERGGFSNLILPIITLS 
MPNVAIFARIIRGSMLEIIQSDFIRTARAKGLSFKKIVIKHMLRGAMLPWSYIGPAFAAIISGSWIEKIFRIAG 
MGMFITESALNRDYPVLMGGLLVYS I I LLI SILISDI IYKILDPRV 

f 603 .nt 

ATGTTAAAGTTTACTTTAAAGAAAATATTAGGAATAATACCAACTTTACTGGTAATAATTTTTTTATGCTTTTTTG 
TAATGAGAATGGCTCCTGGAAGTCCATTTGATTCTGAAAAACCTATTGATCCTCAAGTAAAAGCAAGATTGATGGA 
AAAATATCACCTTGACAAGCCTTTTTATATTCAAGCTTTTTATTACATTACAAACGCTCTCAGGGGAGATCTGGGA 
CCTTCTTTGAAAAAGAAAGACCTTACAGTTAGTCAATACATAAAATTAGGATTTCCAAAATCACTTACACTAGGAG 
TAATATCCCTTATTATATCACTATCAATAGGAATACCAATAGGTATATTAGCTGCCATTTATAAAAATACTTATGT 
GGATTATATAATAACATCAATAGCAATATTGGGGATTTCAATACCATTATTCGTAATAGGGCCAATTTTACAATAT 
TTTTTTGCAATTAAATGGGGTTTGCTTTATACCTCTGGATGGATTACAGAAAGAGGAGGATTTTCAAATTTAATTC 
TACCCATAATAACTCTTAGCATGCCCAACGTAGCTATTTTCGCAAGAATAATCAGAGGATCAATGCTAGAAATAAT 
ACAAAGCGACTTTATAAGAACTGCGCGTGCAAAAGGGCTAAGCTTCAAAAAGATAGTTATAAAGCATATGTTAAGA 
GGAGCAATGTTGCCTGTAGTAAGCTATATAGGTCCAGCATTTGCTGCTATAATATCTGGAAGCGTGGTTATTGAAA 
AAATATTTAGAATTGCTGGAATGGGAATGTTTATAACAGAATCCGCACTAAACAGAGATTACCCAGTATTAATGGG 
CGGATTGTTAGTATATTCAATAATACTGCTTATTTCTATATTAATATCAGATATTATATATAAAATATTAGATCCA 
AGAGTATAA 

tS03.nt 

AGTCCATTTGATTCTGAAAAACCTATTGATCCTCAAGTAAAAGCAAGATTGATGGAAAAATATCACCTTGACAAGC 
CTTTTTATATTCAAGCTTTTTATTACATTACAAACGCTCTCAGGGGAGATCTGGGACCTTCTTTGAAAAAGAAAGA 
CCTTACAGTTAGTCAATACATAAAATTAGGATTTCCAAAATCACTTACACTAGGAGTAATATCCCTTATTATATCA 
CTATCAATAGGAATACCAATAGGTATATTAGCTGCCATTTATAAAAATACTTATGTGGATTATATAATAACATCAA 
TAGCAATATTGGGGATTTCAATACCATTATTCGTAATAGGGCCAATTTTACAATATTTTTTTGCAATTAAATGGGG 
TTTGCTTTATACCTCTGGATGGATTACAGAAAGAGGAGGATTTTCAAATTTAATTCTACCCATAATAACTCTTAGC 
ATGCCCAACGTAGCTATTTTCGCAAGAATAATCAGAGGATCAATGCTAGAAATAATACAAAGCGACTTTATAAGAA 
CTGCGCGTGCAAAAGGGCTAAGCTTCAAAAAGATAGTTATAAAGCATATGTTAAGAGGAGCAATGTTGCCTGTAGT 
AAGCTATATAGGTCCAGCATTTGCTGCTATAATATCTGGAAGCGTGGTTATTGAAAAAATATTTAGAATTGCTGGA 
ATGGGAATGTTTATAACAGAATCCGCACTAAACAGAGATTACCCAGTATTAATGGGCGGATTGTTAGTATATTCAA 
TAATACTGCTTATTTCTATATTAATATCAGATATTATATATAAAATATTAGATCCAAGAGTATAA 

f 607 .aa 

MKYIKIALMLIIFSLIACISNAKKEXIVFRVSNLSEPSSLDPQLSTDLYGSNIITNLFLGLAVKDSQTGKYKPGLA 
KSWNISEDGIIYTFNLREDIVWSDGVAITAEEIKKSYLRILNKKTAAMYANLIKSTIKNAQEYFDETVPESELGIK 
AIDSKTLEITLTSPKPYFPDMLTHSAYIPVPMHIVEKYGENWTNPENIWSGAYKLKERSINDKIVIEKNEKYYNA 
KWEIDEVIFYPTEGSVAY3SIMYINGELDFLQGAEKNNLEEIKIRDDYYSGLKNGMAYIAFNTTIKPLDNLKVRQAI 
SLAIDRETLTKWLKGSSDPTRNLTPKFDDYSYGKNLILFDPENAKKLLAEAGYPDGKGFPTLKYKISEGRPTTAE 
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FLQEQFKKILNINLEIENEEWTTFLGSRRTGNYQMSSVGWIGDYFDPLTFLDSLFTTENHFLGAYKYSNKEYDALI 
KKSNFELDPIKRQDILRQAEEIIAEKDFPMA.PLYIPKSHYLFRNDKWTGWVPNIAESYLYEDIKTKK 

t607 .aa 

CISNAKKEKIVFRVSNLSEPSSLDPQLSTDLYGSNIITNLFLGLAVKDSQTGKYKPGLAKSWNISEDGIIYTFNLR 
EDIVWSDGVAITAEEIKKSYLRILNKKTAAMYANLIKSTIKNAQEYFDETVPESELGIKAIDSKTLEITLTSPKPY 
FPDMLTHSAYIPVPMHIVEKYGENWTNPENIVVSGAYKLKERSINDKIVIEKMEKYYNAK1TVEIDEVIFYPTEGSV 
AYNMYINGELDFLQGAEKNWLEEIKIRDDYYSGLKNGMAYIAFNTTIKPLDNLKVRQAISLAIDRETLTKVVLKGS 
SDPTRNLTPKFDDYSYGKNLILFDPENAKKLLAEAGYPDGKGFPTLKYKISEGRPTTAEFLQEQFKKILNINLEIE 
NEEWTTFLGSRRTGNYQMSSVGWIGDYFDPLTFLDSLFTTENHFLGAYKYSNKEYDALIKXSNFELDPIKRQDILR 
QAEEI IAEKDFPMAPLiYIPKSHYLFRNDKWTGWVPNIAESYLYEDIKTKK 

fSOT.nt 

ATGAAATATATAAAAATAGCCTTAATGCTAATAATTTTTTCTTTAATAGCATGTATTAGTAATGCTAAAAAAGAAA 
AAATAGTTTTCAGAGTATCAAACTTAAGCGAGCCATCATCACTTGATCCTCAACTCTCAACAGACCTTTACGGTAG 
CAACATTATTACAAACCTATTCTTAGGCCTAGCGGTAAAAGATTCTCAAACTGGAAAATATAAACCAGGACTTGCA 
AAAAGTTGGAATATTTCTGAAGATGGAATTATTTACACATTTAACCTAAGAGAAGATATAGTTTGGAGCGATGGAG 
TTGCCATTACTGCCGAGGAGATAAAAAAATCATACCTAAGAATTTTAAATAAAAAAACAGCTGCAATGTATGCTAA 
TTTAATAAAATCTACAATAAAAAATGCACAAGAATATTTCGATGAGACAGTGCCTGAATCTGAGCTTGGCATAAAG 
GCTATTGACAGCAAAACCTTAGAGATAACATTAACATCTCCAAAGCCTTATTTTCCTGATATGCTAACACACTCAG 
CATACATACCAGTTCCAATGCATATTGTTGAAAAATATGGAGAAAATTGGACAAATCCTGAAAATATAGTTGTTAG 
TC-GCGCATACAAACTTAAAGAAAGATCAATTAACGATAAAATCGTAATAGAAAAAAATGAAAAATACTATAATGCA 
AAAAATGTAGAAATTGATGAAGTAATATTTTACCCAACAGAAGGTAGCGTGGCTTACAATATGTACATAAACGGTG 
AACTCGATTTTCTACAAGGAGCAGAAAAGAATAATTTAGAAGAAATTAAAATAAGAGATGATTATTATTCTGGGTT 
AAAAAACGGAATGGCATACATAGCATTCAATACAACAATAAAACCACTAGACAATTTAAAAGTTAGACAAGCCATC 
TCCCTTGCCATTGACAGAGAAACTTTAACTAAAGTAGTTTTAAAGGGAAGTTCAGATCCAACAAGAAATCTAACTC 
CAAAATTTGATGATTATTCTTATGGAAAAAATTTAATACTATTTGATCCTGAGAATGCAAAAAAACTTTTAGCTGA 
AGCTGGATATCCGGATGGGAAAGGATTCCCCACATTAAAATATAAAATATCGGAGGGAAGACCAACAACAGCAGAA 
TTTTTGCAAGAACAATTTAAAAAAATACTAAACATTAACTTAGAAATCGAGAATGAAGAATGGACAACATTCCTAG 
GAAGCAGAAGAACTGGAAATTACCAAATGTCAAGCGTGGGGTGGATAGGAGATTATTTTGATCCCTTAACATTCTT 
AGACAGCTTATTTACAACAGAAAATCATTTTTTAGGAGCGTACAAATATTCAAACAAAGAGTATGATGCTTTAATA 
AAAAAA.TCTAATTTTGAACTTGATCCAATAAAAAGACAAGACATTTTAAGACAAGCTGAAGAGATAATAGCAGAAA 
AAGACTTTCCTATGGCACCTTTATATATACCCAAATCTCATTATCTTTTCAGAAATGATAAATGGACAGGGTGGGT 
ACCAAATATCGCAGAAAGCTATTTATATGAAGATATTAAAACTAAAAAATAA 

t607 .nt 

TGTATTAGTAATGCTAAAAAAGAAAAAATAGTTTTCAGAGTATCAAACTTAAGCGAGCCATCATCACTTGATCCTC 
AACTCTCAACAGACCTTTACGGTAGCAACATTATTACAAACCTATTCTTAGGCCTAGCGGTAAAAGATTCTCAAAC 
TGGAAAATATAAACCAGGACTTGCAAAAAGTTGGAATATTTCTGAAGATGGAATTATTTACACATTTAACCTAAGA 
GAAGATAT AGTTTGG AGCGATGGAGTTGC CATTAC TGC CGAGGAGATAAAAAAATCAT ACC TAAGAATTTTAAAT A 
AAAAAACAGCTGCAATGTATGCTAATTTAATAAAATCTACAATAAAAAATGCACAAGAATATTTCGATGAGACAGT 
GCGTGAATCTGAGCTTGGCATAAAGGCTATTGACAGCAAAACCTTAGAGATAACATTAACATCTCCAAAGCCTTAT 
TTTCCTGATATGCTAACACACTCAGCATACATACCAGTTCCAATGCATATTGTTGAAAAATATGGAGAAAATTGGA 
CAAATCCTGAAAATATAGTTGTTAGTGGCGCATACAAACTTAAAGAAAGATCAATTAACGATAAAATCGTAATAGA 
AAAAAATGAAAAATACTATAATGCAAAAAATGTAGAAATTGATGAAGTAATATTTTACCCAACAGAAGGTAGCGTG 
GCTTACAATATGTACATAAACGGTGAACTCGATTTTCTACAAGGAGCAGAAAAGAATAATTTAGAAGAAATTAAAA 
TAAGAGATGATTATTATTCTGGGTTAAAAAACGGAATGGCATACATAGCATTCAATACAACAATAAAACCACTAGA 
CAATTTAAAAGTTAGACAAGCCATCTCCCTTGCCATTGACAGAGAAACTTTAACTAAAGTAGTTTTAAAGGGAAGT 
TCAGATCCAACAAGAAATCTAACTCCAAAATTTGATGATTATTCTTATGGAAAAAATTTAATACTATTTGATCCTG 
AGAATGCAAAAAAACTTTTAGCTGAAGCTGGATATCCGGATGGGAAAGGATTCCCCACATTAAAATATAAAATATC 
GGAGGGAAGACCAACAACAGCAGAATTTTTGCAAGAACAATTTAAAAAAATACTAAACATTAACTTAGAAATCGAG 
AATGAAGAATGGACAACATTCCTAGGAAGCAGAAGAACTGGAAATTACCAAATGTCAAGCGTGGGGTGGATAGGAG 
ATTATTTTGATCCCTTAACATTCTTAGACAGCTTATTTACAACAGAAAATCATTTTTTAGGAGCGTACAAATATTC 
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7GATGCTTTAATAAAAAAATCTAATTTTGAACTTGATCCAATAAAAAGACAAGACATTTTAAGA 
ATAATAGCAGAAAAAGACTTTCCTATGGCACCTTTATATATACCCAAATCTCATTATCTTTTCA 
GGAC AGGGTGGGT AC C AAAT ATC GC AGAAAGC T ATTT AT ATGAAG AT AT T AAAAC T AAAAAAT A 



MXKIJLFIF ISrYLFGJHDSSLKIGIDDVYVEAHEEGFHLFIRKKPAIKSVILTESFEIPDKKKDVATYSFRTLSY 
NTO^;G3EI?Z^;G?r/i:<NKELLSLTSSTPVPNKKFGEAFHILIPKKLKYGFPNFSTRSGDIDLEVLKSKKEPFWFS 
IRSFZKK-.^TZYLGRYQDNAYELLFKDDQNQGKIEFNELKDTFTKFSDEWIANNGIDIVDKINKILKNSEDSVYDL 
3_ /- — 33:-^.S^;ZZILKEHLFSIIEPQLQKFKSYRIGLVFYKDYLEDFLTKAFDFNTIPYLNNILKYVNVGGG 

GDYPZAVFZGZDAA'/TQFDWRAERRF I IVIGDAPPHEYPRGS IVYKDVINSAKEKD ITI YGI IFQ 



FE3SSZKrC-Z3D\Zr/F-AHSEGFHLFIRKKPAIKSVILTESFEI?DKKKDVATYSFRTLSYNKVNGDEIRILNGRVI 
K^rKEZ-LSirSSTPVPNKXFGEAFHILIPKKLKYGFPNFSTRSGDIDLEVLKSKKEPFWFSIRSFEKKYNDYLGRYQ 
Z^ - ~ ZZJKDDQNCGKISFNELKDTFTKFSDEWIANNGIDIVDKINKILKNSEDSVYDLDLVLVVDVTDSMKSNI 
EZZKZC-rZFSZZZPQZQKFKSYRZGLVFYKDYLEDFLTKAFDFNTZPYLraSTZLKYVWGGGGDYPEAVFEGZDAAVT 
QFZ r ,-r?_\Z?FZ"ZZ7ZGDA?PHEYPRGSIVYKDVINSAKEKDITIYGZIFQ 



a~GAiG-J^J^TTZTTTTATTTCTTTTTATTAGTTTTTATTTGTTTGGATTTGAAGATAGTTCTTTGAAAATAGGTA 
rZG-ZG-ZGTTTA'rGTTGAGGCTCATGAAGAGGGATTTCATCTTTTTATTAGAAAAAAACCTGCAATCAAATCAGT 
AATACTGACAGAGTCTTTTGAAATTCCTGATAAGAAAAAAGATGTGGCTACTTATTCATTTCGTACATTAAGTTAT 
AATA--GGCTAATGGA.GATGAAATTCGGATTTTAAATGGAAGAGTTATTAAGAATAAAGAACTTTTATCATTGACAT 
CTTCCAXCCCTGZCCCTAATAAAAAGTTTGGAGAAGCTTTTCATATATTGATTCCAAAAAAATTAAAATATGGATT 
TCCA.--AZCTC-TCAACAAGAAGTGGTGATATTGACTTAGAAGTATTAAAAAGTAAAAAAGAGCCCTTTTGGTTTTCT 
AT>_AGA.TCZCTTGAGAAAAAATATAATGATTATTTGGGCAGATATCAAGACAATGCTTATGAATTGCTTTTCAAGG 
A~GATCAJ^J\ATCAGGGAAAAATTGAATTTAATGAATTAAAAGATACTTTTACAAAATTTTCAGATGAGGTTGTTAT 
TGCTAATAAZGGCATTGATATTGTTGATAAAATAAACAAAATTTTAAAAAACTCAGAAGATTCAGTTTATGATTTA 
GA7TYAG7GC TTGZTGTTGATGTTACTGATAGTATGAAAAGCAATATTGAGATTCTAAAAGAGCATTTGTTTTCAA 
TAATAGAACCTCAACTTCAAAAGTTTAAATCCTACAGAATAGGTCTTGTTTTTTATAAAGACTATCTTGAAGATTT 
TT~A_ACCAAAGCZ"TTTGATTTTAATACTATTCCTTATTTAAATAATATTCTTAAGTATGTTAATGTTGGTGGCGGT 
GGGG-ATTATCCAGAAGCTGTTTTTGAGGGGATTGATGCTGCTGTGACCCAATTTGATTGGCGGGCAGAAAGAAGGT 
TT-ZTAZZGTTAZAGGAGATGCACCTCCTCATGAGTATCCAAGAGGGTCTATTGTTTATAAAGATGTTATCAATTC 
TGCAA_AC^AA_\AAGATATTA.CAATTTATGGAATAATATTTCAGTAA 



TTTGAAGATA.GTTCTTTGAAAATAGGTATTGATGATGTTTATGTTGAGGCTCATGAAGAGGGATTTCATCTTTTTA 
TTAGAJLAAAAACCTC<:AATCAAATCAGTAATATTGACAGAGTCTTTTGAAATTCCTGATAAGAAAAAAGATGTGGC 
TACTCATTCATT-ZCGTACATTAAGTTATAATAAGGTTAATGGAGATGAAATTCGGATTTTAAATGGAAGAGTTATT 
A^CA_-TA.-JVGAZA.CTTTTATCATTGACATCTTCCACCCCTGTTCCTAATAAAAAGTTTGGAGAAGCTTTTCATATAT 
TGATTCCAAAJ\AAATTAJ\AATATGGATTTCCAAATTTTTCAACAAGAAGTGGTGATATTGACTTAGAAGTATTAA^ 
AAGTAAAAAAGAGCCCTTTTGGTTTTCTATAAGATCTTTTGAGAAAAAATATAATGATTATTTGGGCAGATATCAA 
GACAATC<:TTATGAATTGCTTTTCAAGGATGATCAAAATCAGGGAAAAATTGAATTTAATGAATTAAAAGATACTT 
TTACAJLA^TTTTCAGATGAGGTTGTTATTGCTAATAATGGCATTGATATTGTTGATAAAATAAACAAAATTTTAAA 
AAACC'CAGAAGAZTCAGTTTATGATTTAGATTTAGTGCTTGTTGTTGATGTTACTGATAGTATGAAAAGCAATATT 
GAGAZTCTAAJA-GAGCATTTGTTTTCAATAATAGAACCTCAACTTCAAAAGTTTAAATCCTACAGAATAGGTCTTG 
T7-zrT^TAAAC-ACTATCTTGAAGATTTTTTAACCAAAGCTTTTGATTTTAATACTATTCCTTATTTAAATAATAT 
TCTT.AAGTAZGTZAATGTTGGTGGCGGTGGGGATTATCCAGAAGCTGTTTTTGAGGGGATTGATGCTGCTGTGACC 
CAATTTGATTGGCGGGCAGAAAGAAGGTTTATTATTGTTATAGGAGATGCACCTCCTCATGAGTATCCAAGAGGGT 
CTATTGZZTATAAAGATGTTATCAATTCTGCAAAGGAAAAAGATATTACAATTTATGGAATAATATTTCAGTAA 
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f 617 .aa 

MIFFHJSrSFMALIFSFSILSISYFFGDFFQFSYIKMISWRFILFLIMATGIATCAKSNSLNLGNEGQIYFGAFLVYI 
FSSFFGLTYFNFVFLILLSSFFVGLLGLIPFFITFFFGLNKALTGLLISYGNQRLVDGFILNMLKTGSFSNQTKRI 
NSLFALDSSLIYLFLLGVSVWLFYVFIHKKTIYGLQLSILSNKKKIDIFFNINEFKYKFFAVFGSAFLNGLAGSMF 
WFFRPYLVLGLTSGLGWSSLIVAVISGFNYVYVLFFSLLFSILIEFNNFLNINYDFKYEFIGLCQSIAIFISLFL 
IKARKK 

t617 .aa 

AKSNSLNLGNEGQIYFGAFLVYIFSSFFGLTYFNFVFLILLSSFFVGLLGLIPFFITFFFGLNKALTGLLISYGNQ 
RLVDGFILNMLKTGSFSNQTKRINSLFALDSSLIYLFLLGVSVWLFYVFIHKKTIYGLQLEILSNKKKIDIFFNIN 
EFKYKFFAVFGSAFLNGLAGSMFVVFFRPYLVLGLTSGLGWSSLIVAVISGFNYVYVLFFSLLFSILIEFNNFLNI 
NYDF K YEF I GLC Q S I AI F I S L FL I KARKK 

fS17.nt 

l^f. ATGATCTTTTTTAGAAATAGCTTTATGGCATTAATTTTTTCTTTTTCAATATTAAGTATTAGCTATTTTTTCGGTG 
ATTTTTTTCAATTTTCTTATATTAAAATGATATCTTGGCGCTTTATTTTATTTTTAATTATGGCTACGGGGATTGC 
M . TACTTGTGCCAAGAGTAATTCATTAAATCTTGGGAATGAAGGTCAGATTTATTTTGGGGCATTTTTAGTTTATATA 
i%l TTTTCAAGTTTTTTTGGATTAACCTATTTTAATTTTGTATTTTTGATACTTTTAAGTTCTTTTTTTGTAGGACTTT 
•« TGGGGCTTATCCCCTTTTTTATTACTTTTTTCTTCGGATTAAATAAAGCCTTAACAGGTCTTTTAATATCTTATGG 
AAATCAAAGATTGGTGGATGGATTTATTTTAAATATGTTAAAAACAGGTAGTTTTTCTAATCAGACAAAAAGGATT 
AATAGTTTGTTTGCTTTAGATTCATCACTTATTTACTTGTTTTTGCTTGGTGTATCAGTTTGGCTTTTTTATGTTT 
Q TTATTCACAAAAAAACTATTTATGGTCTTCAGCTTGAAATATTAAGCAATAAAAAAAAGATAGACATTTTTTTCAA 
*y TATAAATGAATTTAAATATAAGTTTTTCGCTGTATTTGGCAGTGCTTTTTTAAATGGTCTTGCAGGTTCTATGTTT 
f|| GTAGTGTTTTTTAGACCATATTTGGTTTTAGGGCTAACTTCAGGACTTGGTTGGAGTAGTCTAATTGTTGCTGTAA 
\J TTTCAGGATTTAATTATGTTTATGTATTATTTTTTAGCTTATTGTTTTCAATATTAATTGAATTTAATAATTTTCT 
Q ^-''^•'^'-^■TAAATTATGACTTTAAGTATGAATTTATTGGGCTTTGTCAATCAATTGCTATTTTTATCTCTTTATTTTTG 
\^ ATTAAAGCTAGGAAAAAGTAG 

tS17.nt 

GCCAAGAGTAATTCATTAAATCTTGGGAATGAAGGTCAGATTTATTTTGGGGCATTTTTAGTTTATATATTTTCAA 
GTTTTTTTGGATTAACCTATTTTAATTTTGTATTTTTGATACTTTTAAGTTCTTTTTTTGTAGGACTTTTGGGGCT 
TATCCCCTTTTTTATTACTTTTTTCTTCGGATTAAATAAAGCCTTAACAGGTCTTTTAATATCTTATGGAAATCAA 
AGATTGGTGGATGC-ATTTATTTTAAATATGTTAAAAACAGGTAGTTTTTCTAATCAGACAAAAAGGATTAATAGTT 
TGTTTGCTTTAGATTCATCACTTATTTACTTGTTTTTGCTTGGTGTATCAGTTTGGCTTTTTTATGTTTTTATTCA 
CAAAAAAACTATTTATGGTCTTCAGCTTGAAATATTAAGCAATAAAAAAAAGATAGACATTTTTTTCAATATAAAT 
GAATTTAAATATAAGTTTTTCGCTGTATTTGGCAGTGCTTTTTTAAATGGTCTTGCAGGTTCTATGTTTGTAGTGT 
TTTTTAGACCATATTTGGTTTTAGGGCTAACTTCAGGACTTGGTTGGAGTAGTCTAATTGTTGCTGTAATTTCAGG 
ATTTAATTATGTTTATGTATTATTTTTTAGCTTATTGTTTTCAATATTAATTGAATTTAATAATTTTCTTAATATA 
AATTATGACTTTAAGTATGAATTTATTGGGCTTTGTCAATCAATTGCTATTTTTATCTCTTTATTTTTGATTAAAG 
CTAGGAAAAAGTAG 

fS31.aa 

MWEINSLRTCYLLVLLLLVAYGLWFYTSSFFLSLELTGNPNFLFFTRLNYLFLSFMVFLVFERISLNFLKK3IF 
PVLIITLFLIMATFLSPSISGAKRWIFFQGVSIQPSEIFKISFTIYLSAYLSKFDPRKJMNGISYWIKPMLIFA.IFW 
VLIILQNDYSTAIYFAILFFIVLFVSNMAFSYVFAIWTFLPVSAIFLMLEPYRVSRIFAFLNPYDDPSGKGYQII 
ASLNALKSGGILGKGLGMGEVKLGKLPEANSDFIFSVLGEELGFLGVLFAISLFFLFFYFGYFIAIHSNSRFKFFI 
AFISSLAIFLQSMMNILIAIGLLPPTGINLPFFSSGGSSIIVTMALSGLISNVSKNLSNN 



tS31 . aa 
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RISLNFLKKSIFPVLIITLFLIMATFLSPSISGAKRWIFFQGVSIQPSEIFKISFTIYLSAYLSKFDPRKNNGISY 
WIKPMLIFAIFWVLIILQNDYSTAIYFAILFFIVLFVSNMAFSYVFAIWTFLPVSAIFLMLEPYRVSRIFAFLN? 
YDDPSGKGYQIIASLNALKSGGILGKGLGMGEVKLGKLPEANSDFIFSVLGEELGFLGVLFAISLFFLFFYFGYFI 
AIHSNSRFKFFIAFISSLAIFLQSMMNILIAIGLLPPTGINLPFFSSGGSSIIVTKALSGLISWSKNLSNW 

'fS31.nt 

ATGGTTGTAGAGATAAATTCACTTAGGACATGTTATTTGCTTGTTTTGCTGCTATTGGTAGCCTATGGCCTTGTAG 
TTTTTT ATAC TTCTTC CTTTTTTCTAAGC TTAGAATTGAC AGGTAATC C AAATTTTTT ATT TTTC ACAAGAC TT AA 
TTATCTTTTTTTAAGTTTTATGGTTTTTCTTGTTTTTGAAAGGATTTCTTTAAATTTTTTAAAAAAATCAATATTT 
CCTGTATTGATTATAACTCTTTTTTTAATTATGGCAACTTTTTTATCTCCAAGTATTTCTGGAGCAAAGAGATGGA 
TATTCTTTCAAGGTGTTAGCATTCAACCTTCTGAGATTTTTAAAATATCTTTTACTATTTATCTTTCAGCTTATTT 
GAGCAAGTTTGACCCAAGAAAAAACAATGGTATTTCATACTGGATAAAGCCAATGTTGATTTTTGCAATTTTTTGG 
GTGTTAATAATTTTGCAAAACGATTATTCAACAGCTATTTATTTTGCCATTCTTTTTTTTATTGTTTTGTTTGTTT 
CTAATATGGCATTTAGCTATGTTTTTGCTATTGTGGTTACTTTTTTACCAGTTTCTGCTATATTCTTGATGCTTGA 
ACCTTATAGGGTTTCTAGAATTTTTGCCTTTCTCAATCCTTACGATGATCCTTCTGGCAAAGGTTACCAGATAATA 
GCATCTCTTAATGCTTTAAAAAGTGGAGGAATTTTAGGTAAAGGGCTGGGAATGGGAGAGGTAAAACTTGGAAAAT 
TACCAGAGGCCAATTCGGATTTTATTTTTTCAGTTCTTGGAGAAGAATTAGGATTTTTAGGGGTTTTGTTTGCTAT 
AAGCTTGTTTTTTTTGTTTTTTTACTTTGGTTATTTTATAGCTATTCATTCTAATAGTAGGTTTAAATTTTTTATT 
GCATTTATTTCAAGTCTTGCAATTTTTCTTCAAAGCATGATGAATATTTTAATTGCAATCGGTCTTTTGCCTCCTA 
CAGGGATAAATTTACCATTTTTTTCATCTGGGGGATCTTCTATTATTGTTACCATGGCATTGTCTGGCCTTATTTC 
AAATGTTTCAAAAAATTTAAGTAATAATTGA 

tS31 -nt 

AGGATTTCTTTAAATTTTTTAAAAAAATCAATATTTCCTGTATTGATTATAACTCTTTTTTTAATTATGGCAACTT 
TTTTATCTCCAAGTATTTCTGGAGCAAAGAGATGGATATTCTTTCAAGGTGTTAGCATTCAACCTTCTGAGATTTT 
TAAAATATCTTTTACTATTTATCTTTCAGCTTATTTGAGCAAGTTTGACCCAAGAAAAAACAATGGTATTTCATAC 
TGGATAAAGCCAATGTTGATTTTTGCAATTTTTTGGGTGTTAATAATTTTGCAAAACGATTATTCAACAGCTATTT 
ATTTTGCCATTCTTTTTTTTATTGTTTTGTTTGTTTCTAATATGGCATTTAGCTATGTTTTTGCTATTGTGGTTAC 
TTTTTTACCAGTTTCTGCTATATTCTTGATGCTTGAACCTTATAGGGTTTCTAGAATTTTTGCCTTTCTCAATCCT 
TACGATGATCCTTCTGGCAAAGGTTACCAGATAATAGCATCTCTTAATGCTTTAAAAAGTGGAGGAATTTTAGGTA 
AAGGGCTGGGAATGGGAGAGGTAAAACTTGGAAAATTACCAGAGGCCAATTCGGATTTTATTTTTTCAGTTCTTGG 
AGAAGAATTAGGATTTTTAGGGGTTTTGTTTGCTATAAGCTTGTTTTTTTTGTTTTTTTACTTTGGTTATTTTATA 
GCTATTCATTCTAATAGTAGGTTTAAATTTTTTATTGCATTTATTTCAAGTCTTGCAATTTTTCTTCAAAGCATGA 
TGAATATTTTAATTGCAATCGGTCTTTTGCCTCCTACAGGGATAAATTTACCATTTTTTTCATCTGGGGGATCTTC 
TATTATTGTTACCATGGCATTGTCTGGCCTTATTTCAAATGTTTCAAAAAATTTAAGTAATAATTGA 

f647 .aa 

MKVNNFLSFFFRAFFLLFLIVILFFFVLFFIDFIGMYNTKRYFPEFVRTKLLGETSLVFDHNSNIILDEARLVKSR 
EAIDIKNQQIEKLKEDLKLKEDSLNKLEFELKQKQKDLDLKQKIIDDIINKY1SIDEEANILQTAVYLMNMPPEDAVK 
RLEDLNPELAISYMRKIEELSKKEGRLSIVPYWLSLMDSKKAAILIRKMSVSSLE 

t647 .aa 

IDFIGMYNTKRYFPEFVRTKLLGETSLVFDHNSNIILDEARLVKEREAIDIKNQQIEKLKEDLKLKEDSLNKLEFE 
LKQKQKDLDLKQKIIDDIINKYNDEEANILQTAVYLMNMPPEDAWRLEDLNPELAISYMRKIEELSKKEGRLSIV 
PYWLSLMDSKKAAILIRXMSVSSLE 

f 647 .nt 

ATGAAAGTGAATAATTTTTTATCGTTCTTTTTTAGGGCATTTTTTTTGTTATTTTTAATTGTTATTTTATTTTTCT 
TTGTATTATTCTTTATTGATTTTATTGGAATGTATAATACTAAAAGATATTTCCCCGAATTTGTAAGAACCAAGTT 
GTTAGGAGAAACTTCTCTGGTCTTTGATCATAATTCTAATATAATTCTTGATGAAGCTAGACTTGTGAAGGAAAGA 
GAAGCTATTGATATTAAGAATCAGCAGATTGAAAAGCTTAAAGAAGATCTAAAGTTAAAAGAAGACAGTTTAAATA 
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AGC TTGAATTTGAGC TT AAGC AAAAGC AG AAAGATTT AGATTT AAAAC AAAAAAT AAT AG ATGAC ATTAT AAAT AA 
ATATAATGATGAGGAAGCAAATATTTTGCAAACAGCTGTATATTTAATGAATATGCCACCAGAAGATGCTGTTAAG 
CGGCTTGAAGATTTAAATCCCGAGCTTGCAATATCTTATATGCGGAAAATTGAAGAGCTTTCCAAAAAAGAAGGTC 
GTTTATCAATTGTTCCTTATTGGTTATCTCTTATGGATTCTAAAAAAGCTGCTATATTGATTAGAAAAATGTCTGT 
TAGTTCATTGGAGTAG 

t647 .nt 

ATTGATTTTATTGGAATGTATAATACTAAAAGATATTTCCCCGAATTTGTAAGAACCAAGTTGTTAGGAGAAACTT 
CTCTGGTCTTTGATCATAATTCTAATATAATTCTTGATGAAGCTAGACTTGTGAAGGAAAGAGAAGCTATTGATAT 
TAAGAATCAGCAGATTGAAAAGCTTAAAGAAGATCTAAAGTTAAAAGAAGACAGTTTAAATAAGCTTGAATTTGAG 
CTTAAGCAAAAGCAGAAAGATTTAGATTTAAAACAAAAAATAATAGATGACATTATAAATAAATATAATGATGAGG 
AAGC AAAT ATTTTGCAAACAGCTGT AT ATTTAATGAATATGCCACCAGAAGATGCTGTTAAGCGGCTTGAAGATTT 
AAATCCCGAGCTTGCAATATCTTATATGCGGAAAATTGAAGAGCTTTCCAAAAAAGAAGGTCGTTTATCAATTGTT 
CCTTATTGGTTATCTCTTATGGATTCTAAAAAAGCTGCTATATTGATTAGAAAAATGTCTGTTAGTTCATTGGAGT 
AG 

f653 .aa 

MLTYGDMVTLLLVFFVTMFSLNDIIFQENVIRIMSASFTGAGFFKGGKTLDFSKLSYLSNSFMSLPSTVRNKQASQ 

TAKNKSMIEFIEKIQSKNIVVRQEERGIVISLAADAFFDSASADVKLEENRDSIQKIASFIGFLSPRGYNFKIEGH 

TDNIIOTJVNGFWKSNWELSAARSV^ 

TTDASLSFPKEIKQ 

t653 .aa 

NDIIFQENVIRIMSASFTGAGFFKGGKTLDFSKLSYLSNSFMSLPSTVRNKQASQTAKNKSMIEFIEKIQSKNIW 
RQEERGIVISLAADAFFDSASADVKLEEWRDSIQKIASFIGFLSPRGYNFKIEGHTDNIDTDVNGPWKSNWELSAA 
RSVNMLEHILNYLDQSDVKRIENNFEVSGFGGSRPIATDDTPEGRAYNRRIDILITTDASLSFPKEIKQ 

f653 .nt 

ATGTTGACTTATGGAGACATGGTTACTTTGCTGCTTGTGTTTTTTGTTACAATGTTTTCATTAAATGATATTATTT 
TTCAAGAAAATGTGATAAGAATAATGTCTGCTTCTTTCACGGGTGCTGGATTTTTCAAGGGCGGTAAAACTTTAGA 
TTTTAGTAAATTATCTTATTTGAGTAATAGCTTTATGTCTTTGCCTTCTACTGTGCGCAATAAACAAGCATCTCAG 
ACTGCTAAAAATAAATCCATGATTGAATTTATTGAGAAGATTCAGTCTAAAAATATTGTAGTTAGGCAAGAAGAAA 
GAGGTATTGTAATATCTCTTGCAGCAGATGCATTTTTTGATTCTGCTAGTGCAGATGTTAAGCTTGAAGAGAATAG 
AGATTCTATTCAAAAAATAGCATCTTTTATTGGCTTTTTAAGTCCTAGAGGCTATAATTTTAAAATAGAAGGGCAT 
ACAGATAATATTGATACTGATGTAAATGGACCTTGGAAAAGCAATTGGGAACTTTCGGCTGCTAGATCTGTTAATA 
TGCTGGAACATATTTTGAACTATTTAGATCAATCTGATGTTAAAAGAATTGAAAATAATTTTGAAGTATCTGGTTT 
TGGTGGAAGTAGGCCTATTGCAACAGACGATACCCCTGAGGGTAGGGCTTATAATAGAAGAATTGATATATTAATT 
ACTACAGATGCATCTTTAAGTTTCCCTAAGGAAATTAAGCAGTAA 

t653 .nt 

AATGATATTATTTTTCAAGAAAATGTGATAAGAATAATGTCTGCTTCTTTCACGGGTGCTGGATTTTTCAAGGGCG 
GTAAAACTTTAGATTTTAGTAAATTATCTTATTTGAGTAATAGCTTTATGTCTTTGCCTTCTACTGTGCGCAATAA 
ACAAGCATCTCAGACTGCTAAAAATAAATCCATGATTGAATTTATTGAGAAGATTCAGTCTAAAAATATTGTAGTT 
AGGCAAGAAGAAAGAGGTATTGTAATATCTCTTGCAGCAGATGCATTTTTTGATTCTGCTAGTGCAGATGTTAAGC 
TTGAAGAGAATAGAGATTCTATTCAAAAAATAGCATCTTTTATTGGCTTTTTAAGTCCTAGAGGCTATAATTTTAA 
AATAGAAGGGCATACAGATAATATTGATACTGATGTAAATGGACCTTGGAAAAGCAATTGGGAACTTTCGGCTGCT 
AGATCTGTTAATATGCTGGAACATATTTTGAACTATTTAGATCAATCTGATGTTAAAAGAATTGAAAATAA.TTTTG 
AAGTATCTGGTTTTGGTGGAAGTAGGCCTATTGCAACAGACGATACCCCTGAGGGTAGGGCTTATAATAGAAGAAT 
TGATATATTAATTACTACAGATGCATCTTTAAGTTTCCCTAAGGAAATTAAGCAGTAA 

f664.aa 



TABLE 1 . Nucleotide and Amino Acid Sequences 



MRMSVYTMGFAYIRSIMGYVVLFFFASLAVNFFVNIIQVGFFITFKSLEPRWDKISFNFSRWAKNSFFSAGAFFNL 
FKSLLKWIICLIYYFIIENNIGKISKLSEYTLQSGISIVLVIAYKICFFSVMFLAIVGVFDYLFQRSQYIESLKM 
TKEEVKQERKEMEGDPLLRSRIKERMRVILSTNLRVAIPQADWITNPEHFAVAXKWDSETMLAPKVLAKGQDEIA 
LT IKKI AREMNVPLMENKLL ARAL YANVKVTMEE I PREYWE I VSKILVRVYS I TKKFN 

t6 6 4.aa 

FVWIIQVGFFITFKSLEPRWDKISFNFSRWAKNSFFSAGAFFNLFKSLLXWIICLIYYFIIENNIGKISKLSEYT 
LQSGISIVLVIAYKICFFSVMFLAIVGVFDYLFQRSQYIESLICMTKEEVKQERKEMEGDPLLRSRIKERMRVILST 
NLRVAIPQADWITNPEHFAVAIKWDSETMLAPKVLAKGQDEIALTIKKIARENNVPLMENKLLARALYANVKVNE 
E I PREYWE TVS K I L VRVYS I TKKFN 

f 664..nc 

ATGCGTATGAGTGTTTATACTATGGGTTTTGCATATATTAGATCTATCATGGGGTATGTCGTTTTGTTTTTTTTCG 
CATCTTTAGCTGTTAATTTTTTTGTTAATATTATTCAAGTAGGCTTTTTTATTACTTTTAAATCTTTGGAGCCAAG 
GTGGGATAAAATTAGTTTTAATTTTTCCAGATGGGCAAAAAATTCTTTTTTTTCAGCAGGGGCTTTTTTCAATTTG 
TTTAAAAGTTTGTTAAAAGTTGTTATAATATGCTTGATATATTATTTTATTATAGAAAACAATATAGGCAAAATTT 
CTAAGCTTTCGGAGTATACACTTCAATCTGGAATTTCTATTGTGTTAGTGATTGCCTATAAGATATGTTTTTTTTC 
AGTAATGTTTTTGGCAATTGTAGGGGTGTTTGATTATTTGTTTCAAAGATCTCAGTACATTGAGAGTTTGAAAATG 
ACAAAAGAAGAGGTAAAGCAGGAAAGAAAGGAAATGGAAGGTGATCCTTTACTTCGATCTAGAATAAAAGAGAGAA 
TGAGGGTTATTTTAAGTACCAATTTAAGAGTAGCTATTCCTCAAGCAGATGTAGTAATTACAAATCCAGAACATTT 
TGCAGTTGCTATTAAATGGGATAGCGAAACAATGTTAGCTCCAAAGGTGCTTGCAAAAGGTCAAGATGAAATAGCT 
CTCACAATTAAAAAAATTGCAAGAGAAAATAATGTTCCTTTAATGGAAAATAAGCTCCTTGCAAGAGCTCTTTATG 
CTAATGTTAAGGTTAATGAAGAGATTCCAAGAGAATATTGGGAGATTGTTTCAAAAATTCTTGTGAGAGTATATTC 
TATTACTAAAAAGTTTAATTAG 

t6€4 .nt 

TTTGTTAATATTATTCAAGTAGGCTTTTTTATTACTTTTAAATCTTTGGAGCCAAGGTGGGATAAAATTAGTTTTA 
ATTTTTCCAGATGGGCAAAAAATTCTTTTTTTTCAGCAGGGGCTTTTTTCAATTTGTTTAAAAGTTTGTTAAAAGT 
TGTTATAATATGCTTGATATATTATTTTATTATAGAAAACAATATAGGCAAAATTTCTAAGCTTTCGGAGTATACA 
CTTCAATCTGGAATTTCTATTGTGTTAGTGATTGCCTATAAGATATGTTTTTTTTCAGTAATGTTTTTGGCAATTG 
TAGGGGTGTTTGATTATTTGTTTCAAAGATCTCAGTACATTGAGAGTTTGAAAATGACAAAAGAAGAGGTAAAGCA 
GGAAAGAAAGGAAATGGAAGGTGATCCTTTACTTCGATCTAGAATAAAAGAGAGAATGAGGGTTATTTTAAGTACC 
AATTTAAGAGTAGCTATTCCTCAAGCAGATGTAGTAATTACAAATCCAGAACATTTTGCAGTTGCTATTAAATGGG 
ATAGCGAAACAATGTTAGCTCCAAAGGTGCTTGCAAAAGGTCAAGATGAAATAGCTCTCACAATTAAAAAAATTGC 
AAGAGAAAATAATGTTCCTTTAATGGAAAATAAGCTCCTTGCAAGAGCTCTTTATGCTAATGTTAAGGTTAATGAA 
GAGATTCCAAGAGAATATTGGGAGATTGTTTCAAAAATTCTTGTGAGAGTATATTCTATT^CTAAAAAGTTTAAT'T 
AG 

f680 .aa 

MFTLSFVLINFIITGILILMLEFNFLKVDFKGNILLAGIFMGLMQGLGALPGISRSGITIFSASVIGFNRKSAFEI 
SFLSLIPIWGAILLKHKEFYDIFMVLNFFEINLGALVAFWGIFSINFFFKMLNNKKLYYFSIYLFALSIIVCYF 
VRI 

t680 .aa 

ITGILILMLEFNFLKVDFKGNILLAGIFMGLMQGLGALPGISRSGITIFSASVIGFNRKSAFEISFLSLIPIVFGA 
ILLKHKEFYDIFtlVLNFFEINLGALVAFWGIFSINFFFKMLNNKKLYYFSIYLFALSIIVCYFVRI 



f680 .nt 
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AtGTTTACATTGTCTTTCGTTTTAATTAATTTTATTATAACAGGGATTTTAATCTTGATGCTAGAATTTAATTTTT 
TAAAAGTTGATTTTAAAGGTAATATTTTGTTAGCAGGAATTTTTATGGGGCTGATGCAAGGCTTGGGTGCGCTTCC 
AGGAATCTCTCGTTCAGGAATTACGATCTTTTCGGCATCGGTTATTGGATTTAATAGAAAAAGTGCATTTGAAATT 
TCATTTTTATCTTTAATTCCAATAGTTTTTGGAGCGATTTTATTAAAACATAAAGAATTTTATGATATTTTTATGG 
TTTTAAATTTTTTTGAAATAAACTTAGGAGCATTAGTTGCTTTTGTTGTTGGTATTTTCTCAATAAATTTCTTTTT 
TAJ\AATGCTTAATAACAAAAAACTGTATTATTTTTCTATATATTTATTTGCACTTTCAATTATAGTTTGTTATTTT 
GTTAGAATATGA 

t680.nt 

ATAACAGGGATTTTAATCTTGATGCTAGAATTTAATTTTTTAAAAGTTGATTTTAAAGGTAATATTTTGTTAGCAG 
GAATTTTTATGGGGCTGATGCAAGGCTTGGGTGCGCTTCCAGGAATCTCTCGTTCAGGAATTACGATCTTTTCGGC 
ATCGGTTATTGGATTTAATAGAAAAAGTGCATTTGAAATTTCATTTTTATCTTTAATTCCAATAGTTTTTGGAGCG 
ATTTTATTAAAACATAAAGAATTTTATGATATTTTTATGGTTTTAAATTTTTTTGAAATAAACTTAGGAGCATTAG 
TTGGTTTTGTTGTTGGTATTTTCTCAATAAATTTCTTTTTTAAAATGCTTAATAACAAAAAACTGTATTATTTTTC 

□ TATATATTTATTTGCACTTTCAATTATAGTTTGTTATTTTGTTAGAATATGA 

^ f688.aa 

Q MIVLLISIGCANAVHIINEIFKLIKKEQLSKESIKATIKKLKTPILLTSFTTAFGFLSLTTSSINAYKTMGIFMSI 
;|| GVIISMIISLTVLPGIITLIPFAKKKSFEKEKENKLNKISFLERLAKLNTQITKSILKRKYTSSIMVLIILGISFV 
UJ - GLLKIEINFDEKDYFKESTSVKKTLNLMQKEMGGISIFKIEIEGRPGEFKNAKAMQILDLITDKLDAFSAKTQSSS 

□ INGILKFTNPKIKKESPLEYKLPENKIILNKLINLIDKSDWTKDNKRMYINDDWSLISIIVRIEDNSTEGIKKFEK 
YAINTIWEYMKNWKYHFSGVYDKVLIAKTMVKEQVINIITTLGSITLLLMFFFKSIKTGIIIAIPVAWSVFLNFAV 

q MRLFGITLNPATATIASVSMGVGVDYSIHFFNTFILQYQKNQIYKTALLESIPNVFNGIFANSISVGIGFLTLTFS 
SYKIISTLGAIIAFTMLTTSLASLTLLPLLIYLFKPRVKLASNNNFKKLKQZ 

l}\ t688.aa 

YKTMGIFMSIGVIISMIISLTVLPGIITLIPFAKKKSFEKEKENKLNKISFLERLAKLNTQITKSILKRKYTSSIM 
■ 5i? VLULGISFVGLLKIEINFDEKDYFKESTSVKKTLNLMQKEMGGISIFKIEIEGRPGEFKNAKAMQILDLITDKLD 
AFSAKTQSSSINGILKFTNFKIKKESPLEYKLPENKIILNKLINLIDKSDWTKDNKRMYINDDWSLISIIVRIEDN 
■STEGIKKFEKYAINTINEYMKNNKYHFSGVYDKVLIAKTMVKEQVINIITTLGSITLLLMFFFKSIKTGIIIAIPV 
AWSVFUSIFAVMRLFGITLNPATATIASVSMGVGVDYSIHFFNTFILQYQICNQIYKTALLESIPNVFNGIFANSISV 
GIGFLTLTFSSYKIISTLGAIIAFTMLTTSLASLTLLPLLIYLFKPRVKLASNNNFKKLKQZ 

f68 8 .nt 

ATGATTGTTTTACTTATTTCAATCGGATGCGCCAATGCTGTACATATAATAAATGAAATATTTAAATTAATAAAAA 
AAGAACAGCTCTCAAAAGAATCCATAAAAGCAACAATTAAAAAACTTAAAACACCCATCCTGCTAACATCTTTTAC 
AACTGCATTTGGATTTTTATCTCTTACAACCTCTTCAATTAATGCCTACAAAACAATGGGTATTTTCATGTCAATT 
GGAGTAATTATCTCAATGATAATCTCATTAACCGTTTTACCTGGAATAATAACATTAATCCCATTTGCAAAAAAAA 
AGTCTTTTGAAAAAGAAAAAGAAAATAAACTAAATAAAATATCCTTCCTTGAAAGACTTGCCAAACTAAATACGCA 
AATAACAAAATCTATATTAAAAAGAAAATATACATCCTCTATAATGGTCCTCATCATACTGGGAATTTCTTTTGTA 
GGTCTTTTAAAAATCGAAATCAATTTTGATGAAAAAGATTACTTTAAAGAAAGCACAAGTGTAAAAAAAACATTAA 
ACCTAATGCAAAAAGAAATGGGGGGAATATCGATTTTCAAAATAGAAATTGAAGGCAGGCCCGGTGAATTTAAAAA 
TGCTAAAGCAATGCAAATATTAGACTTAATTACAGATAAGCTTGATGCATTTTCTGCAAAAACTCAATCTAGTTCT 
ATTAATGGCATTTTAAAATTTACAAATTTTAAAATTAAAAAAGAATCCCCACTAGAGTATAAACTGCCTGAAAATA 
AAATTATAGTAAACAAACTAATAAATTTGATAGATAAAAGCGATTGGACTAAGGACAATAAAAGAATC-TACATTAA 
CGATGACTGGTCATTAATATCTATCATAGTAAGAATTGAAGACAACTCAACCGAAGGAATAAAAAAATTTGAAAAA 
TATGCTATTAACACAATTAATGAATATATGAAAAATAATAAATATCATTTCTCAGGTGTTTATGATAAGGTATTAA 
TAGCTAAAACAATGGTAAAAGAACAGGTTATAAACATTATAACAACTCTTGGATCAATAACACTACTACTTATGTT 
TTTCTTTAAATCTATAAAAACCGGAATAATTATTGCAATCCCAGTAGCATGGTCAGTGTTTTTAAACTTTGCTGTA 
ATGAGATTATTTGGGATAACCTTAAACCCCGCAACGGCAACAATTGCATCTGTAAGCATGGGAGTAGGAGTAGATT 
ATTCAATTCATTTTTTCAATACATTTATTTTACAATACCAAAAAAATCAAATCTACAAAACTGCACTTCTTGAATC 
AATACCCAATGTATTTAATGGAATATTTGCAAATTCTATTTCTGTTGGAATAGGATTTTTAACTCTAACATTTTCG 
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TCTTATAAAATAATATCAACTCTTGGAGCAATAATTGCTTTTACAATGCTAACGACATCTCTTGCATCACTAACTC 
TTCTTCCATTATTAATTTATTTATTTAAACCTAGAGTAAAGCTAGCCTCAAACAACAATTTTAAAAAATTAAAACA 
ATAA 

t688 .nt 

TACAAAACAATGGGTATTTTCATGTCAATTGGAGTAATTATCTCAATGATAATCTCATTAACCGTTTTACCTGGAA 
TAATAACATTAATCCCATTTGCAAAAAAAAAGTCTTTTGAAAAA.GAAAAAGAAAATAAACTAAATAAAATATCCTT 
CCTTGAAAGACTTGCCAAACTAAATACGCAAATAACAAAATCTATATTAAAAAGAAAATATACATCCTCTATAATG 
GTCCTCATCATACTGGGAATTTCTTTTGTAGGTCTTTTAAAAATCGAAATCAATTTTGATGAAAAAGATTACTTTA 
AAGAAAGCACAAGTGTAAAAAAAACATTAAACCTAATGCAAAAAGAAATGGGGGGAATATCGATTTTCAAAATAGA 
AATTGAAGGCAGGCCCGGTGAATTTAAAAATGCTAAAGCAATGCAAATATTAGACTTAATTACAGATAAGCTTGAT 
GCATTTTCTGCAAAAACTCAATCTAGTTCTATTAATGGCATTTTAAAATTTACAAATTTTAAAATTAAAAAAGAAT 
CCCCACTAGAGTATAAACTGCCTGAAAATAAAATTATACTAAACAAACTAATAAATTTGATAGATAAAAGCGATTG 
GACTAAGGACAATAAAAGAATGTACATTAACGATGACTGGTCATTAATATCTATCATAGTAAGAATTGAAGACAAC 
TCAACCGAAGGAATAAAAAAATTTGAAAAATATGCTATTAACACAATTAATGAATATATGAAAAATAATAAATATC 
ATTTCTCAGGTGTTTATGATAAGGTATTAATAGCTAAAACAATGGTAAAAGAACAGGTTATAAACATTATAACAAC 
TCTTGGATCAATAACACTACTACTTATGTTTTTCTTTAAATCTATAAAAACCGGAATAATTATTGCAATCCCAGTA 
GCATGGTCAGTGTTTTTAAACTTTGCTGTAATGAGATTATTTGGGATAACCTTAAACCCCGCAACGGCAACAATTG 
CATCTGTAAGCATGGGAGTAGGAGTAGATTATTCAATTCATTTTTTCAATACATTTATTTTACAATACCAAAAAAA 
TCAAATCTACAAAACTGCACTTCTTGAATCAATACCCAATGTATTTAATGGAATATTTGCAAATTCTATTTCTGTT 
GGAATAGGATTTTTAACTCTAACATTTTCGTCTTATAAAATAATATCAACTCTTGGAGCAATAATTGCTTTTACAA 
TGCTAACGACATCTCTTGCATCACTAACTCTTCTTCCATTATTAATTTATTTATTTAAACCTAGAGTAAAGCTAGC 
CTCAAACAACAATTTTAAAAAATTAAAACAATAA 

f704.aa 

MNYTKFQEFISEFLGTFILLALGTGSVAMTVLFSSSPEIPGEIIKGGYTNIVFGWGLGVTFGIYTAARMSGAHLNP 
AVSIGLASVGKFPVSKLLHYIVAQILGAFTGALMTLWFYPKWIEMDPGLENTQGIMATFPAVPGFLPGFIDQIFG 
TFLLMFLISWGDFTKKHSDNPFIPFIVGAWLSIGISFGGMNGYAINPARDLGPRILLLFAGFKNHGFNNLSIVI 
VPIIGPIIGAILGATIYEFTLKNNKD 

t704.aa 

GEIIKGGYTNIVFGWGLGVTFGIYTAARMSGAHLNPAVSIGLASVGKFPVSKLLHYIVAQILGAFTGALMTLWFY 
PKWIEMDPGLENTQGIMATFPAVPGFLPGFIDQIFGTFLLMFLISWGDFTKKHSDNPFIPFIVGAWLSIGISFG 
GMNGYAINPARDLGPRILLLFAGFKlsrHGFNNLSIVIVPIIGPIIGAILGATIYEFTLKNNKD 

f704.nt ■ 

ATGAATTATACAAAATTCCAAGAATTTATATCGGAATTTTTGGGAACATTTATCCTATTGGCTCTAGGAACTGGAT 
CTGTTGCAATGACAGTATTATTTTCCTCAAGTCCCGAAATACCAGGAGAAATAATAAAAGGAGGATATACAAATAT 
AGTATTTGGATGGGGATTGGGTGTAACGTTTGGTATTTACACAGCAGCAAGAATGAGCGGAGCACACCTAAACCCA 
GCTGTTAGCATAGGATTAGCAAGTGTTGGAAAGTTTCCCGTTTCAAAACTTTTACATTACATTGTAGCACAAATAT 
TAGGAGCTTTTACAGGTGCATTAATGACACTTGTCGTATTTTATCCTAAATGGATAGAAATGGATCCTGGCTTAGA 
AAATACTCAAGGAATAATGGCAACTTTCCCTGCTGTTCCTGGATTTTTGCCTGGATTTATTGATCAAATTTTTGGA 
ACTTTTTTGCTAATGTTTTTAATTTCTGTTGTTGGAGATTTTACAAAAAAACACAGCGACAATCCATTTATTCCTT 
TTATTGTAGGAGGAGTGGTTTTATCAATAGGGATAAGTTTCGGAGGAATGAACGGTTATGCTATTAATCCTGCAAG 
GGATCTGGGACCAAGAATTTTACTCTTATTTGCTGGATTTAAAAATCACGGATTTAACAATCTAAGTATAGTTATT 
GTACCAATAATTGGCCCAATAATTGGAGCAATTTTGGGAGCTACAATTTACGAATTTACACTAAAAAATAACAAAG 
ACTAA 

t704 .nt 

GGAGAAATAATAAAAGGAGGATATACAAATATAGTATTTGGATGGGGATTGGGTGTAACGTTTGGTATTTACACAG 
CAGCAAGAATGAGCGGAGCACACCTAAACCCAGCTGTTAGCATAGGATTAGCAAGTGTTGGAAAGTTTCCCGTTTC 
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AAAACTTTTACATTACATTGTAGCACAAATATTAGGAGCTTTTACAGGTGCATTAATGACACTTGTCGTATTTTAT 

CCTAAATGGATAGAAATGGATCCTGGCTTAGAAAATACTCAAGGAATAATGGCAACTTTCCCTGCTGTTCCTGGAT 

TTTTGCCTGGATTTATTGATCAAATTTTTGGAACTTTTTTGCTAATGTTTTTAATTTCTGTTGTTGGAGATTTTAC 

AAAAAAACACAGCGACAATCCATTTATTCCTTTTATTGTAGGAGCAGTGGTTTTATCAATAGGGATAAGTTTCGGA 

GGAATGAACGGTTATGCTATTAATCCTGCAAGGGATCTGGGACCAAGAATTTTACTCTTATTTGCTGGATTTAAAA 

ATCACGGATTTAACAATCTAAGTATAGTTATTGTACCAATAATTGGCCCAATAATTGGAGCAATTTTGGGAGCTAC 

AATTTACGAATTTACACTAAAAAATAACAAAG 

ACTAA 

f707 .aa 

MRRLFLLYILCSFVFLNLFAQGSSSYIDKQKELAIFYYEVGQRYINVGKIKKGKLFQAKALKIYPDLKKGFDIKLA 
VKELDARIKDDNPKWMLEDIKLEEIPGIVHEKIEINDFTNAPKIEYIAQRERSKNQDKIIKFQFGKFARALISRN 
FDLFDSVIADKVNVMGQFESKNDFISTLSSASSKADADELEYLSVDDYYDLKSLKISKSNDTSFAVN^ 
KNF PFWKERQTL I FTTEDDNNWFLS S INZ 

c707 .aa 

MRRLFLLYILCSFVFLNLFAQGSSSYIDKQKELAIFYYEVGQRYINVGKIKKGKLFQAKALKIYPDLKKGFDIKLA 
VKELDARIKDDNPKVVMLEDIKLEEIPGIVHEKIEINDFTNAPKIEYIAQRERSKNQDKIIKFQFGKFARALISRN 
FDLFDS VI ADKVWVMGQFES KNDF I S TLS SAS SKADADELEYLS VDDYYDLKSLK I SKSNDTSFAVNVNAKKNDVT 
KNFPFWKERQTLIFTTEDDNNWFLSSINZ 

f707 .nt 

ATGAGAAGATTATTTCTTCTATATATTTTATGTTCTTTTGTTTTTTTGAATTTATTTGCTCAAGGTAGTTCTTCTT 
ATATTGATAAGCAAAAAGAGCTTGCTATTTTTTATTATGAGGTTGGTCAAAGATATATAAACGTTGGTAAAATTAA 
AAAAGGAAAGCTTTTTCAAGCAAAAGCTTTAAAGATTTATCCAGATTTGAAAAAGGGGTTTGATATCAAGCTTGCA 
GTTAAAGAGCTTGATGCTAGGATTAAAGATGACAATCCCAAGGTTGTTATGCTTGAGGATATTAAGCTTGAGGAGA 
TACCTGGAATAGTGCACGAAAAAATAGAAATCAATGATTTTACAAATGCTCCTAAAATAGAATATATTGCTCAAAG 
AGAGAGAAGCAAAAATCAAGATAAAATTATTAAGTTTCAATTTGGAAAGTTTGCAAGAGCTTTAATTTCTAGGAAC 
TTTGATTTGTTTGATTCAGTTATTGCGGATAAAGTTAACGTTATGGGTCAATTTGAATCAAAAAATGATTTTATAT 
CAACTTTATCAAGTGCTTCATCTAAGGCCGATGCTGATGAGTTAGAGTATTTATCAGTTGATGATTATTACGATTT 
AAAGTCTTTAAAAATTTCAAAATCCAACGATACTTCTTTTGCTGTTAATGTTAATGCCAAAAAAAATGATGTTACT 
AAAAATTTTCCATTTTGGAAAGAACGTCAAACTTTAATTTTTACTACAGAGGATGATAATAATTGGTTTTTGTCTT 
CCATAAATTGA 

t707 .nt 

CAAGGTAGTTCTTCTTATATTGATAAGCAAAAAGAGCTTGCTATTTTTTATTATGAGGTTGGTCAAAGATATATAA 
ACGTTGGTAAAATTAAAAAAGGAAAGCTTTTTCAAGCAAAAGCTTTAAAGATTTATCCAGATTTGAAAAAGGGGTT 
TGATATCAAGCTTGCAGTTAAAGAGCTTGATGCTAGGATTAAAGATGACAATCCCAAGGTTGTTATGCTTGAGGAT 
ATTAAGCTTGAGGAGATACCTGGAATAGTGCACGAAAAAATAGAAATCAATGATTTTACAAATGCTCCTAAAATAG 
AATATATTGCTCAAAGAGAGAGAAGCAAAAATCAAGATAAAATTATTAAGTTTCAATTTGGAAAGTTTGCAAGAGC 
TTTAATTTCTAGGAACTTTGATTTGTTTGATTCAGTTATTGCGGATAAAGTTAACGTTATGGGTCAATTTGAATCA 
AAAAATGATTTTATATCAACTTTATCAAGTGCTTCATCTAAGGCCGATGCTGATGAGTTAGAGTATTTATCAGTTG 
ATGATTATTACGATTTAAAGTCTTTAAAAATTTCAAAATCCAACGATACTTCTTTTGCTGTTAATGTTAATGCCAA 
AAAAAATGATGTTACTAAAAATTTTCCATTTTGGAAAGAACGTCAAACTTTAATTTTTACTACAGAGGATGATAAT 
AATTGGTTTTTGTCTTCCATAAATTGA 

f 709 .aa 

MLIFGFIGLFFLNIFSLHAQGIVTTNKDAQEEFKWALNSYNNGIYDDALLSFKKILSFDPNNLDYKFWTGNVYYRLG 
YVEEALMEWRNLKDQGYKVPYLRHLISTIEQRRGIFSNYELNFXKLVKVASLDNSIYKRPHGYQIT3LRADKYGGY 
YAANFVGNEI LYFDVNNNVNALVKDGF SYLKS PYDVIEANNLLYVTL YSSDEIGVYDKVLGVKRKSI GNKGTKDGE 
LLAPQYMAIDKRNYIYVSSWGNKRVSKFGLEGDFILHFGSRTSGYKGLLGPTGVTYLNENIYVADSLRNTIEVFDT 
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SGNHLYSVFTSIEGIEGLSSDFVGNWIVSSKDGVYKYSIAKKTITKILKADKMNSKISSSILDAKMQMIVSDFNN 
AKV 7 SWKSDASLYDSLNVDVRRIIRLGGPKIWELWSSKSGL?WGLKSENFSISNEISrYYIWPKVAYNVNASKD 
INIAWFDKSSYMKKYDTDQIVGLNALMELSKNKNFSFINATSVPIIDNIESLTNSIRJMTSSLGPYSTDAVKTDVS 
LKLAGSGLMSKSSRRAVVYF SGG I LNRKAFEKYS LDT IVS YYKNND IRF YL I LFGNDP INS KLQYL VNETGGAVI P 
F S SYEGVS KVYDL I LEQKTGT YLLEYYYPGPQ EPNKYFNLS VEANINQQTGRGEF AYF IN 

t709 .aa 

QGIVTNKDAQEEFKWALNSYl^GIYDDALLSFKKILSFDPNNLDYHFWTGNVYYRLGY^/EEALMEWKNLKDQGYKV 
PYLRHLISTIEQRRGIFSNYELNFKKLVKVAS LDNSIYKRPHGYQITSLRADKYGGYYAANFVGNEILYFDVNNNV 
NALVKDGFSYLKSPYDVIEAOTNfLLYVTLYSSDEIGVYDKVL^ 

WGNKRVSKFGLEGDFILHFGSRTSGYKGLLGPTGVTYLNENIYVADSLRNTIEVFDTSGNHLYSVFTSIEGIEGLS 
SDFVGMWIVSSKDGWKYSIAiCKTITKILKADKMNSKISSSILDAJSnsrQMIVSDFNNAKVSVY 

VRRIIRLGGPKIYVELNVSSKSGLPWGLKSENFSISNENYYIWPKVAYNVNASKDINIAVVFDKSSYMKKYDTD 
QIVGLNALMELSKNKNFSFINATSVPIIDNIESLTNSIRNTSSLGPYSTDAVKTDVSLKLAGSGLMSKSSRRAVVY 
FSGGILNRKAFEKYSLDTIVSYYKNNDIRFYLILFGNDPINSKLQYLVNSTGGAVIPFSSYEGVSKVYDLILEQKT 
GTYLLEYYYPGPQEPNKYFNLSVEANINQQTGRGEFAYFIN 

-f70 9 .nt 

ATGTTAATTTTTGGTTTTATTGGTTTGTTTTTTTTAAATATTTTTAGTTTGCATGCCCAAGGAATAGTTACTAATA 
AAGATGCTCAAGAAGAGTTTAAATGGGCTCTTAATTCTTATAATAATGGAATTTACGATGATGCTCTTTTATCTTT 
TAAAAAAATTTTAAGCTTTGATCCTAATAATCTTGATTATCATTTTTGGACTGGCAATGTTTATTATAGACTGGGT 
TATGTTGAAGAAGCTTTAATGGAATGGAGAAATTTAAAAGATCAAGGCTATAAGGTTCCCTATCTTAGACATTTGA 
TTTCTACTATTGAGCAAAGGAGAGGTATTTTTTCAAATTATGAACTTAATTTTAAAAAACTTGTAAAAGTTGCTTC 
TCTTGATAATTCTATTTATAAAAGGCCACATGGGTACCAGATTACATCTTTAAGGGCTGATAAGTACGGCGGATAT 
TACGCTGCTAACTTTGTAGGCAATGAAATATTGTATTTTGATGTTAATAACAATGTTAATGCTTTAGTTAAAGATG 
GCTTTAGTTATTTAAAATCACCTTATGATGTTATTGAAGCTAATAATCTGCTTTATGTGACTCTTTATTCAAGTGA 
TGAAATTGGTGTTTATGACAAAGTTCTTGGAGTTAAAAGGAAATCTATTGGGAATAAAGGCACAAAAGATGGCGAA 
TTGCTTGCTCCTCAGTATATGGCTATTGATAAGAGAAACTATATTTATGTAAGTGAGTGGGGAAATAAAAGAGTAA 
GTAAATTTGGACTTGAAGGTGATTTTATTTTGCATTTTGGTTCTAGAACTTCAGGCTATAAGGGCCTTTTAGGTCC 

cacaggcgttacttatttgaatgaaaacatttatgttgcagattctctgagaaataccattgaagtttttgatact 
agtggtaatcatttatattcagtttttacttctattgagggaatagaggggcttagcagtgattttgtaggtaata 
atgttatagtatcctcaaaagatggtgtttataaatatagcattgctaaaaagacaattacaaaaattttaaaagc 
agataaaatgaattctaaaatttcttcatctattttggatgccaataatcagatgattgtctcagattttaataat 
gccaaggtttcagtttacaagagtgatgcaagcctttatgatagtttaaatgttgatgttagaagaataattaggc 
ttggagggcctaaaatttacgttgagcttaatgttagcagtaaaagcggattaccagttgttgggcttaaaagtga 
aaatttttcaatttcaaatgaaaattattacattgtcaatcccaaggtggcatataatgtaaatgcttcaaaaga.c 
attaatatagcagttgtttttgataaatcttcttatatgaaaaaatatgatacagatcaaattgtagggttaaatg 
ccctaatGgagttgtcaaaaaataaaaactttagttttataaatgcaacaagtgtgcccattatagataatattga 
aagcttaacaaatagcattagaaatacaagttctcttggtccttatagtacagatgctgtaaaaacagacgttagt 
ttgaagttggcaggttctc-ggcttatgtcaaaaagctcaagaagagcagtagtttattttagtggtggtattttaa 
atcgtaaagcttttgaaaagtactctttagatacaatagtaagctattataaaaataatgatataaggttttactt 
aatactatttggtaatgatcctattaatagtaagcttcagtatttagttaatgaaacaggcggtgctgtaattcct 
ttttcatcttatgaaggtgtatctaaagtttatgatttaattttagaacaaaaaacgggcacttatttgttggaat 
attattatccaggccctcaagaacctaataaatattttaatttatctgttgaagcaaatataaatcaacagacagg 
aagaggggagtttgcatattttattaattag 

t709.nt 

CAAGGAATAGTTACTAATAAAGATGCTCAAGAAGAGTTTAAATGGGCTCTTAATTCTTATAATAATGGAATTTACG 
ATGATGCTCTTTTATCTTTTAAAAAAATTTTAAGCTTTGATCCTAATAATCTTGATTATCATTTTTGGACTGGCAA 
TGTTTATTATAGACTGGGTTATGTTGAAGAAGCTTTAATGGAATGGAGAAATTTAAAAGATCAAGGCTATAAGGTT 
CCCTATCTTAGACATTTGATTTCTACTATTGAGCAAAGGAGAGGTATTTTTTCAAATTATGAACTTAATTTTAAAA 
AACTTGTAAAAGTTGCTTCTCTTGATAATTCTATTTATAAAAGGCCACATGGGTACCAGATTACATCTTTAAGGGC 
TGATAAGTACGGCGGATA.TTACGCTGCTAACTTTGTAGGCAATGAAATA.TTGTATTTTGATGTTAATAACAATGTT 
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AATGCTTTAGTTAAAGATGGCTTTAGTTATTTAAAATCACCTTATGATGTTATTGAAGCTAATAATCTGCTTTATG 
TGACTCTTTATTCAAGTGATGAAATTGGTGTTTATGACAAAGTTCTTGGAGTTAAAAGGAAATCTATTGGGAATAA 
AGGCACAAAAGATGGCGAATTGCTTGCTCCTCAGTATATGGCTATTGATAAGAGAAACTATATTTATGTAAGTGAG 
TGGGGAAATAAAAGAGTAAGTAAATTTGGACTTGAAGGTGATTTTATTTTGCATTTTGGTTCTAGAACTTCAGGCT 
ATAAGGGCCTTTTAGGTCCCACAGGCGTTACTTATTTGAATGAAAACATTTATGTTGCAGATTCTCTGAGAAATAC 
CATTGAAGTTTTTGATACTAGTGGTAATCATTTATATTCAGTTTTTACTTCTATTGAGGGAATAGAGGGGCTTAGC 

■ AGTGATTTTGTAGGTAATAATGTTATAGTATCCTCAAAAGATGGTGTTTATAAATATAGCATTGCTAAAAAGACAA 
TTACAAAAATTTTAAAAGCAGATAAAATGAATTCTAAAATTTCTTCATCTATTTTGGATGCCAATAATCAGATGAT 
TGTCTCAGATTTTAATAATGCCAAGGTTTCAGTTTACAAGAGTGATGCAAGCCTTTATGATAGTTTAAATGTTGAT 
GTTAGAAGAATAATTAGGCTTGGAGGGCCTAAAATTTACGTTGAGCTTAATGTTAGCAGTAAAAGCGGATTACCAG 
TTGTTGGGCTTAAAAGTGAAAATTTTTCAATTTCAAATGAAAATTATTACATTGTCAATCCCAAGGTGGCATATAA 
TGTAAATGCTTCAAAAGACATTAATATAGCAGTTGTTTTTGATAAATCTTCTTATATGAAAAAATATGATACAGAT 
CAAATTGTAC-GGTTAAATGCCCTAATGGAGTTGTCAAAAAATAAAAACTTTAGTTTTATAAATGCAACAAGTGTGC 
CCATTATAGATAATATTGAAAGCTTAACAAATAGCATTAGAAATACAAGTTCTCTTGGTCCTTATAGTACAGATGC 
TGTAAAAACAGACGTTAGTTTGAAGTTGGCAGGTTCTGGGCTTATGTCAAAAAGCTCAAGAAGAGCAGTAGTTTAT 
TTTAGTGGTGGTATTTTAAATCGTAAAGCTTTTGAAAAGTACTCTTTAGATACAATAGTAAGCTATTATAAAAATA 

.ATGATATAAGGTTTTACTTAATACTATTTGGTAATGATCCTATTAATAGTAAGCTTCAGTATTTAGTTAATGAAAC 
AGGCGGTGCTGTAATTCCTTTTTCATCTTATGAAGGTGTATCTAAAGTTTATGATTTAATTTTAGAACAAAAAACG 
GGCACTTATTTGTTGGAATATTATTATCCAGGCCCTCAAGAACCTAATAAATATTTTAATTTATCTGTTGAAGCAA 
ATATAAATCAACAGACAGGAAGAGGGGAGTTTGCATATTTTATTAATTAG 

f730 .aa 

MIKSILDYLLTLHPVLLGLLGSTFTWFTTAFGAAAVFFFRKVDNKIMDAMLGF SAGIMIAASFFSLIQPAIERA.EE 
LGYITWVPAVFGFLVGAFFIYIVDVFVPDLDKLTFIDEDLTKHGKKDFLLFTAVTLHNFPEGLAVGVAFGALASNP 
DxQTLVGAMLLTLGIGIQNIPEGAAISLPLRRGNVALAKCFNYGQMSGLVEIVGGLMGAYAVYSFTRILPFALAFS 
AGAMIYVSIEQLIPEAKRKDIDNKVPSIFGVIGFTLMMFLDVSLGZ 

t73 0 .aa 

AVFFFRKVDNKIMDAMLGFSAGIMIAASFFSLIQPAIERAEELGYITWVPAVFGFLVGAFFIYIVDVFVPDLDKLT 
FIDEDLTKHGKKDFLLFTAVTLHNFPEGLAVGVAFGALASNPDIQTLVGAMLLTLGIGIQNIPEGAAISLPLRRGN 
VALAKCFNYGQMSGLVEIVGGLMGAYAVYSFTRILPFALAFSAGAMIYVSIEQLIPEAKRKDIDNKVPSIFGVIGF 
TLMMFLDVSLGZ 

£7 3 0.nt 

ATGATAAAATCAATTTTAGATTATTTATTAACTTTGCATCCTGTATTATTGGGACTTTTAGGTTCTACTTTCACTT 
GGTTTACTACAGCTTTTGGAGCAGCAGCAGTTTTTTTCTTTAGAAAGGTAGATAATAAAATAATGGACGCTATGCT 
TGGTTTTTCAGCTGGCATTATGATAGCGGCCAGTTTTTTTTCGCTTATTCAGCCTGCTATAGAAAGAGCTGAAGAG 
CTTGGATACATTACTTGGGTGCCGGCTGTTTTTGGATTTCTTGTTGGGGCATTTTTTATATATATTGTAGATGTAT 
TTGTTCCAGATCTGGATAAACTTACTTTTATTGATGAAGACTTAACTAAACATGGTAAAAAAGATTTTTTACTCTT 
TACTGCTGTTACTTTACATAATTTTCCAGAAGGATTGGCTGTTGGAGTTGCTTTTGGAGCCTTC-GCGTCTAATCCA 
GATATTCAAACTTTAGTTGGGGCTATGCTTCTTACGCTTGGTATTGGTATTCAAAATATTCCCGAAGGAGCAGCTA 
TTTCTCTGCCTTTAAGAAGAGGTAATGTTGCTTTGGCAAAATGCTTTAACTATGGCCAAATGTCAGGATTGGTAGA 
AATTGTGGGGGGGCTTATGGGTGCTTATGCGGTTTATTCTTTTACTCGAATTTTAGCTTTTGCTTTGGCTTTTTCT 
GCAGGAGCTATGATTTATGTGTCAATTGAACAATTAATACCTGAAGCTAAGAGAAAAGACATTGACAATAAAGTGC 
CAAGTATATTTGGTGTTATTGGTTTTACATTAATGATGTTTCTCGATGTTTCACTAGGTTAA 

t730 .nt 

GCAGTTTTTTTCTTTAGAAAGGTAGATAATAAAATAATGGACGCTATGCTTGGTTTTTCAGCTGGCATTATGATAG 
CGGCCAGTTTTTTTTCGCTTATTCAGCCTGCTATAGAAAGAGCTGAAGAGCTTGGATACATTACTTGGGTGCCGGC 
TGTTTTTGGATTTCTTGTTGGGGCATTTTTTATATATATTGTAGATGTATTTGTTCCAGATCTGGATAAACTTACT 
TTTATTGATGAAGACTTAACTAAACATGGTAAAAAAGATTTTTTACTCTTTACTGCTGTTACTTTACATAATTTTC 
CAGAAGGATTGGCTGTTGGAGTTGCTTTTGGAGCCTTGGCGTCTAATCCAGATATTCAAACTTTAGTTGGGGCTAT 
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GCTTCTTACGCTTGGTATTGGTATTCAAAATATTCCCGAAGGAGCAGCTATTTCTCTGCCTTTAAGAAGAGGTAAT 
GTTGCTTTGGCAAAATGCTTTAACTATGGCCAAATGTCAGGATTGGTAGAAATTGTGGGGGGGCTTATGGGTGCTT 
ATGCGGTTTATTCTTTTACTCGAATTTTACCTTTTGCTTTGGCTTTTTCTGCAGGAGCTATGATTTATGTGTCAAT 
TGAACAATTAATACCTGAAGCTAAGAGAAAAGACATTGACAATAAAGTGCCAAGTATATTTGGTGTTATTGGTTTT 
ACATTAATGATGTTTCTCGATGTTTCACTAGGTTAA 



fl97 .aa 

MLLKLXYRFVGFLLLFLIFILLLFSTIFNFVLCGYLEDYYKQLTRAQVRRAAFSLQSFLDTLHVIINGAASNLALE 
TISEFAMSENRGKDFSESELIDLRKNPKFVIDSVKVSKKYRQYLYNFMANLKNDTLFEEFAFFDFEGRVIVSTRHE 
NNMDFGHSEANTNYFKKAVEDYRQNQLKFIGWYSNLSEGISAEVAIRSKQSEKKAFAIIVPVYSPEDKLVCGYLAG 
YLLNDIVADSFDRFRFGFYKRGNFIYVDPNjSIIAWPFEEYNETSRVSSKFLNVLKDVFSKPPFPSNIASEVSVYTI 
DRILLSEMGEDCYYAMLPISSKLGEKSGVLIARLPYKDIYGVISSLRFQYILYSVLGIIALSIVLSIRIDRIISFR 
' LNAIRVLVQDMVKGNLDKDYALDDDENTLDELGMLSLQWKMKKAI S VAI S SVLRNI S YWKASLEVASSSQNLSS 
SALQQASALEEMSAWEQIASGVIMSATOISYETEQIALKTNENSQIGGRAVEESVIAMQDIVEKVSVIEEIARKTN 
LLALNAAIEAARAGDEGKGFAWASEIRKLADLSKISALEIGELVEDNSKVATEAGVIFKEMLPEIEETANLVKKI 
SEGSSKQSDQIAQFKMALDQVGEWQSSASSSEQLSSMSDKMLEKSKELRKSVLFFKIKDSKIENPENDDYDFRLI 
DCPENSFKDENQNLKSNGISTSNASGHNNYSLDIESESSVRTINKRVDPKKAID1ADKDLNFDDDFSEF 

tl97.aa 

VLCGYLEDYYKQLTRAQVRRAAFSLQSFLDTLHVIINGAASNLA-LETISEFAMSENRGKDFSESELIDLRKjMPKFV 

idsvkvskkyrqylynfmanlkndtlfeefaffdfegrvivstrhennmdfghseantnyfkkavedyrqnqlkfi 

GWYSNLSEGISAEVAIRSKQSEKKAFAIIVPVYSPEDKLVCGYLAGYLLNDIVADSFDRFRFGFYKRGNFIYVDPN 

niavnpfeeynetsrvsskflnvlkdvfskppfpsniasevsvytidrillsemgedcyyamlpissklgeksgvl 
iarl p ykdi ygvi s slrfqyi lys vlg i ialsivls iridri i s frlnairvl vqdmvkgnldkdyaldddentld 
elgmlslqwkmkkaisvaissvlrwisywkaslevasssqnlsssalqqasaleemsaweqiasgv1smsakms 
^eteqialktnensqiggraveesviamqdivekvsvieeiarktnllalnaaieaaragdegkgfawaseirkl 
adlskisaleigelvednskvateagvifkemlpeieetanlvkkisegsskqsdqiaqfkmaldqvgewqssas 
sseqlssmsdkmlekskelrksvlffkikdskienpenddydfrlidcpensfkdenqnlksngistsnasghnny 
sldiesessvrtinkrvdpkkaidiadkdlnfdddfsef 

fl97.nt 

atgttattgaagcttaaatacaggtttgttggatttttattattgtttttaatttttatactgctacttttttcca 
cgatttttaattttgttttatgcggttatttagaagattattataagcagcttacaagggcgcaagtaagaagagc 
agctttttctttgcaatcttttttagacaccctgcatgtcataatcaatggtgcagcttctaatttggcacttgaa 

ACCATATCAGAATTTGCAATGTCTGA-GAATAGAGGAAAAGATTTCTCTGAGTCGGAATTGATAGATTTAAGAAAAA 
ATCCAAAATTTGTTATTGACTCTGTAAAGGTGAGCAAAAAATATCGACAATACTTATACAATTTTATGGCCAATCT 
TAAAAATGATACCCTTTTTGAAGAATTCGCTTTTTTTGATTTTGAAGGGAGAGTAATTGTTAGCACAAGACATGAG 
AATAATATGGATTTTGGT cattc tgaggc TAATAC CAATT ATTTTAAAAAAGCTGTTGAGG ATT at aggcaaaacc 

aattaaaatttataggttggtattcaaatctttctgaaggaatatccgcagaagttgctattaggtctaaacaaag 
cgaaaaaaaggcttttgcaataattgtacctgtatattccccagaagataaacttgtttgtgggtatttggccgga 
tatttgcttaatgatattgtggcagatagttttgatagatttagattcggtttttataaaagaggcaattttattt 
atgtggatcccaacaatatagcagttaatccttttgaagaatataatgaaaccagcagggttagttctaaattttt 
gaatgttcttaaagatgttttctctaagcccccttttccatcaaacattgccagtgaagtgtcggtttacactatt 
gatagaatacttttgtccgaaatgggagaagattgttattatgcaatgttgcccataagtagtaaattgggagaaa 
agagtggagtacttattgctaggcttccttataaggatatttacggagtaatatctagtctaagatttcagtatat 
tttatattcagtcttaggcattatagcattaagtattgttctttcaattagaatagacaggattattagttttcgt 
ttaaacgcaattagagttctagttcaagatatggttaagggcaatttagataaagattatgctcttgatgatgatg 
-aaaatactcttgatgaacttggcatgttaagtcttcaggttgttaaaatgaaaaaagctatttctgtagcaatttc 
tagtgttttgagaaatattagctatgtaaataaggcaagtttagaagttgccagttcaagtcaaaatttaagctct 
agtgcattgcaacaggcatctgctcttgaagaaatgtcagctaatgttgagcaaatagcctcaggtgtcaacatga 
gcgccaataattcttatgaaacagaacaaatagctttaaagacgaatgaaaattctcagataggtggtagggccgt 
tgaagaatctgttattgctatgcaagacattgtggagaaagttagtgttattgaagagatagctagaaaaaccaat 
ttacttgctttgaatgcggctattgaagctgcaagagcaggagatgagggaaagggatttgctgttgtggccagtg 
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AGATTAGAAAGTTGGCTGATTTGAGTAAAATTTCTGCTCTTGAGATTGGAGAGTTAGTTGAAGATAACTCTAAGGT 
AGCAACTGAAGCGGGAGTGATCTTTAAAGAAATGCTACCCGAAATTGAAGAAACGGCTAATCTTGTTAAGAAGATT 
TCAGAAGGTAGCTCTAAGCAAAGCGATCAGATTGCTCAATTTAAAATGGCTTTAGATCAGGTTGGAGAAGTTGTTC 
AATCTTCAGCTTCAAGCAGTGAGCAGCTTTCTAGTATGTCCGATAAAATGTTAGAAAAGTCTAAGGAACTTAGAAA 
ATCTGTATTATTTTTCAAAATTAAAGATTCTAAAATTGAAAATCCAGAAAATGATGATTATGATTTCAGGTTAATA 
GATTGTCCTGAAAATTCTTTTAAAGATGAAAATCAAAATTTGAAAAGCAATGGAATTTCTACTTCAAATGCCAGTG 
GGCATAATAATTATTCTTTAGATATTGAGAGCGAATCTTCTGTAAGAACTATTAATAAGCGAGTTGATCCTAAAAA 
AGCTATCGATATTGCTGATAAGGATTTAAATTTTGATGATGATTTTTCAGAGTTTTAG 

tl97.nt 

GTTTTATGCGGTTATTTAGAAGATTATTATAAGCAGCTTACAAGGGCGCAAGTAAGAAGAGCAGCTTTTTCTTTGC 
AATCTTTTTTAGACACCCTGCATGTCATAATCAATGGTGCAGCTTCTAATTTGGCACTTGAAACCATATCAGAATT 
TGCAATGTCTGAGAATAGAGGAAAAGATTTCTCTGAGTCGGAATTGATAGATTTAAGAAAAAATCCAAAATTTGTT 
ATTGACTCTGTAAAGGTGAGCAAAAAATATCGACAATACTTATACAATTTTATGGCCAATCTTAAAAATGATACCC 
TTTTTGAAGAATTCGCTTTTTTTGATTTTGAAGGGAGAGTAATTGTTAGCACAAGACATGAGAATAATATGGATTT 
TGGTCATTCTGAGGCTAATACCAATTATTTTAAAAAAGCTGTTGAGGATTATAGGCAAAACCAATTAAAATTTATA 
GGTTGGTATTCAAATCTTTCTGAAGGAATATCCGCAGAAGTTGCTATTAGGTCTAAACAAAGCGAAAAAAAGGCTT 
TTGCAATAATTGTACCTGTATATTCCCCAGAAGATAAACTTGTTTGTGGGTATTTGGCCGGATATTTC-CTTAATGA 
TATTGTGGCAGATAGTTTTGATAGATTTAGATTCGGTTTTTATAAAAGAGGCAATTTTATTTATGTGGATCCCAAC 
AATATAGCAGTTAATCCTTTTGAAGAATATAATGAAACCAGCAGGGTTAGTTCTAAATTTTTGAATGTTCTTAAAG 
ATGTTTTCTCTAAGCCCCCTTTTCCATCAAACATTGCCAGTGAAGTGTCGGTTTACACTATTGATAGAATACTTTT 
GTCCGAAATGGGAGAAC-ATTGTTATTATGCAATGTTGCCCATAAGTAGTAAATTGGGAGAAAAGAGTGGAGTACTT 
ATTGCTAGC-CTTCCTTATAAGGATATTTACGGAGTAATATCTAGTCTAAGATTTCAGTATATTTTATATTCAGTCT 
TAGGCATTATAGCATTAAGTATTGTTCTTTCAATTAGAATAGACAGGATTATTAGTTTTCGTTTAAACGCAATTAG 
AGTTCTAGTTCAAGATATGGTTAAGGGCAATTTAGATAAAGATTATGCTCTTGATGATGATGAAAATACTCTTGAT 
GAACTTGGCATGTTAAGTCTTCAGGTTGTTAAAATGAAAAAAGCTATTTCTGTAGCAATTTCTAGTGTTTTGAGAA 
ATATTAGCTATGTAAATAAGGCAAGTTTAGAAGTTGCCAGTTCAAGTCAAAATTTAAGCTCTAGTGCATTGCAACA 
GGCATCTGCTCTTGAAGAAATGTCAGCTAATGTTGAGCAAATAGCCTCAGGTGTCAACATGAGCGCCAATAATTCT 
TATGAAACAGAACAAATAGCTTTAAAGACGAATGAAAATTCTCAGATAGGTGGTAGGGCCGTTGAAGAATCTGTTA 
TTGCTATGCAAGACATTGTGGAGAAAGTTAGTGTTATTGAAGAGATAGCTAGAAAAACCAATTTACTTGCTTTGAA 
TGCGGCTATTGAAGCTGCAAGAGCAGGAGATGAGGGAAAGGGATTTGCTGTTGTGGCCAGTGAGATTAGAAAGTTG 
GCTGATTTGAGTAAAATTTCTGCTCTTGAGATTGGAGAGTTAGTTGAAGATAACTCTAAGGTAGCAACTGAAGCGG 
GAGTGATCTTTAAAGAAATGCTACCCGAAATTGAAGAAACGGCTAATCTTGTTAAGAAGATTTCAGAAGGTAGCTC 
TAAGCAAAGCGATCAGATTGCTCAATTTAAAATGGCTTTAGATCAGGTTGGAGAAGTTGTTCAATCTTCAGCTTCA 
AGCAGTGAGCAGCTTTCTAGTATGTCCGATAAAATGTTAGAAAAGTCTAAGGAACTTAGAAAATCTGTATTATTTT 
TCAAAATTAAAGATTCTAAAATTGAAAATCCAGAAAATGATGATTATGATTTCAGGTTAATAGATTGTCCTGAAAA 
TTCTTTTAAAGATGAAAATCAAAATTTGAAAAGCAATGGAATTTCTACTTCAAATGCCAGTGGGCATAATAATTAT 
TCTTTAGATATTGAGAGCGAATCTTCTGTAAGAACTATTAATAAGCGAGTTGATCCTAAAAAAGCTATCGATATTG 
CTGATAAGGATTTAAATTTTGATGATGATTTTTCAGAGTTTTAG 

f200 .aa 

MTISKNVFSKFILKFLNSSAFVSVFALFVGFLIVGLVVMGLGHSPFRMYFIILEIIFSSPKHLGYVLSYSAPLIFT 

GLSIGISLKAGLFNIGVEGQFILGSIVALIASVLLDLPPILHVITIFIITFLASGSLGILIGYLKAKFNISEVISG 

IMFNWILFHLNWIILDFSFIKRDNSDFSKPIKESAYIDFLASWKLSPHGLAYRSSHPFVKELLKAPLKFGIILGII 

FAILIWFLLNKTIIGFKINATGSNIEASRCMGINVKAVLIFSMFLSAAVAGLAGAIQLMGVNKAIFKLSYMQGIGF 

NGIAASLMGNUSPIGIIFSSILFSILLYGSSRVQSLMGLPSSIVSLMMGIIVLVISASYFLNKIVLKGVKRVKYNN 
ILD 



t2 0 0.aa 

LWMGLGHSPFRMYFIILEIIFSSPKHLGYVLSYSAPLIFTGLSIGISLKAGLFNIGVEGQFILGSIVALIASVLL 
DLPPILHVITIFIITFLASGSLGILIGYLKAKFNISEVISGIMFNWILFKLNNIILDFSFIKRDNSDFSKPIKESA 
YIDFLASWKLSPEGLAYRSSHPFWELLKAPLHFGIILGIIFAILIWFLLNKTIIGFKINATGSNIEASRCMGIMV 





TABLE 1. Nucleotide and Amino Acid Sequences 

KAVLIFSMFLSAA.VAGLAGAIQLMGVNKAIFKLSYMQGIGFNGIAASLMGNNSPIGIIFSSILFSILLYGSSRVQS 
LMGL P SS I VSLMMGI IVLVI SAS YFLNK I VLKGVKRVKYNNI LD 



ATGACAATTAGTAAAAACGTATTTAGTAAATTTATTTTGAAATTTTTAAATTCTTCAGCATTTGTTAGTGTATTTG 
CTCTATTTGTTGGATTTTTAATTGTTGGGCTAGTGGTGATGGGGCTTGGTCATTCTCCTTTTAGAATGTATTTTAT 
AATATTAGAAATTATTTTTTCTTCTCCCAAACATTTAGGTTATGTTTTAAGTTATTCAGCTCCTTTGATTTTTACA 
GGTCTTTCTATTGGTATTTCTTTAAAAGCGGGTCTTTTTAATATTGGGGTTGAAGGCCAGTTTATACTAGGATCTA 
TTGTTGCTTTAATAGCATCAGTTTTACTTGATTTGCCTCCAATTTTACATGTAATTACTATTTTTATTATTACTTT 
TTTAGCTTCAGGCAGTTTAGGAATTTTAATCGGATATTTAAAAGCCAAATTCAATATTAGCGAAGTGATTTCAGGA 
ATAATGTTTAATTGGATATTATTTCATTTAAATAATATAATTTTAGATTTTAGTTTTATTAAAAGAGATAATAGTG 
ATTTTTCAAAACCCATTAAAGAAAGCGCATATATTGATTTTTTAGCTTCTTGGAAGCTCTCACCAGAAGGTCTTGC 
TTATAGATCTTCTCATCCTTTTGTTAATGAGCTTTTAAAAGCACCTCTTCATTTTGGAATAATTTTAGGTATAATT 
TTTGCTATTTTAATATGGTTTTTACTTAATAAAACTATTATTGGATTTAAAATAAATGCCACAGGAAGTAATATTG 
AAGCTTCAAGATGTATGGGTATTAATGTAAAAGCTGTGCTAATTTTTTCAATGTTTCTCTCAGCAGCTGTTGCAGG 
TCTTGCTGGTGCTATTCAACTTATGGGTGTTAATAAAGCTATATTTAAGCTTTCTTATATGCAAGGAATTGGTTTT 
AATGGGATAGCTGCTTCTCTTATGGGAAACAATTCGCCAATTGGCATAATATTTTCTAGCATTCTTTTTTCTATAT 
TGCTTTATGGAAGCAGTAGAGTTCAAAGTTTAATGGGCCTTCCATCTTCAATTGTATCTTTGATGATGGGAATAAT 
TGTTCTTGTAATTTCTGCTAGCTATTTTTTAAATAAAATTGTTTTAAAAGGTGTTAAGCGTGTCAAATATAATAAT 
ATTC TTG ATTAG 



GGGCTAGTGGTGATGGGGCTTGGTCATTCTCCTTTTAGAATGTATTTTATAATATTAGAAATTATTTTTTCTTCTC 
CCAAACATTTAGGTTATGTTTTAAGTTATTCAGCTCCTTTGATTTTTACAGGTCTTTCTATTGGTATTTCTTTAAA 
AGCGGGTCTTTTTAATATTGGGGTTGAAGGCCAGTTTATACTAGGATCTATTGTTGCTTTAATAGCATCAGTTTTA 
CTTGATTTGCCTCCAATTTTACATGTAATTACTATTTTTATTATTACTTTTTTAGCTTCAGGCAGTTTAGGAATTT 
TAATCGGATATTTAAAAGCCAAATTCAATATTAGCGAAGTGATTTCAGGAATAATGTTTAATTGGATATTATTTCA 
TTTAAATAATATAATTTTAGATTTTAGTTTTATTAAAAGAGATAATAGTGATTTTTCAAAACCCATTAAAGAAAGC 
GCATATATTGATTTTTTAGCTTCTTGGAAGCTCTCACCAGAAGGTCTTGCTTATAGATCTTCTCATCCTTTTGTTA 
ATGAGCTTTTAAAAGCACCTCTTCATTTTGGAATAATTTTAGGTATAATTTTTGCTATTTTAATATGGTTTTTACT 
TAATAAAACTATTATTGGATTTAAAATAAATGCCACAGGAAGTAATATTGAAGCTTCAAGATGTATGGGTATTAAT 
GTAAAAGCTGTGCTAATTTTTTCAATGTTTCTCTCAGCAGCTGTTGCAGGTCTTGCTGGTGCTATTCAACTTATGG 
GTGTTAATAAAGCTATATTTAAGCTTTCTTATATGCAAGGAATTGGTTTTAATGGGATAGCTGCTTCTCTTATGGG 
AAACAATTCGCCAATTGGCATAATATTTTCTAGCATTCTTTTTTCTATATTGCTTTATGGAAGCAGTAGAGTTCAA 
AGTTTAATGGGCCTTCCATCTTCAATTGTATCTTTGATGATGGGAATAATTGTTCTTGTAATTTCTGCTAGCTATT 
TTTTAAATAAAATTGTTTTAAAAGGTGTTAAGCGTGTCAAATATAATAATATTCTTGATTAG 



MVKKFSIFLKAIIIFSIFELLIEELSIILFLPYKIRFALIFLGFLFDTIFIFIFLYKITKAYLSQRLEIYVRNNLF 
FDIIHCLIPLAFYSSYQLKNIIVAHETILNPIMLSLFKLRFLRLLRFNDLIIEIYYNSKEKNLILIAFARTFSMSL 
LIPFTFFIIISSSKIVNSIPEKQEFNIIKNISIINEKAYIKEKYPFILIIKEKDDIIYSKSDEIFVYYSPSEYRVI 
EMEKTKFYIDKYLQRKSDSILGIFLFTLFASFTIFLMNFYKFFKASFLNPIILMTKILQDPLEYRKIQIPFTLSEE 
KVYELAKSFNNLLLKEKLNSKRKSKIPLEIEKVKKIINKNQEIK 



IIIFSIFELLIEELSIILFLPYKIRFALIFLGFLFDTIFIFIFLYKITKAYLSQRLEIYVP2TNLFFDIIKCLIPLA 
FYSSYQLKNIIVAHETILNPIMLSLFKLRFLRLLRFNDLIIEIYYNSKEXNLILIAFARTFSMSLLIPFTFFIIIS 
SSKIVNSIPSKQEFN-IIKNISIINEKAYIKEKYPFILIIKEKDDIIYSKSDEIFVYYSPSEYRVIEMEKTKFYIDK 
YLQRKSDSILGIFLFTLFASFTIFLidNFYKFFKASFLNPIILiMTKILQDPLEYRKIQIPFTLSEEKVYELAKSFTNTN 
LLLKEKLNSKRKSKIPLEIEKVKKIINKNQSIK 



f200.nt 



■ t200 .nt 



f208 .aa 



t2 0 8 .aa 



f 2 08 .nt 
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ATGGTAAAAAAATTTTCAATTTTCTTAAAAGCAATAATAATTTTTTCAATATTTGAACTTTTAATCGAAGAACTCT 
CAATAATTCTTTTTTTACCATACAAAATACGATTTGCACTAATATTTCTTGGGTTTCTATTTGACACAATTTTTAT 
TTTCATTTTTTTATACAAAATAACCAAGGCCTACCTTTCCCAAAGATTAGAAATCTACGTCAGAAACAATCTATTC 
TTCGATATAATCCACTGCCTTATTCCTTTAGCGTTTTATAGCTCATATCAGCTTAAAAACATAATTGTCGCCCATG 
AAACAATATTAAATCCAATAATGCTATCACTTTTCAAGTTAAGATTTTTAAGACTTCTTAGGTTTAATGACCTAAT 
AATAGAAATATATTACAATTCAAAAGAAAAGAACCTAATACTAATAGCATTTGCTAGGACATTTTCAATGAGCTTA 
TTAATACCATTTACATTTTTTATAATAATATCAAGCTCAAAAATTGTAAATTCAATACCAGAAAAACAAGAATTTA 
ATATCATTAAAAATATATCAATAATAAATGAAAAAGCTTACATTAAAGAAAAATATCCCTTCATCTTAATAATCAA 
GGAAAAAGATGACATAATATACTCAAAATCAGACGAAATATTTGTTTACTACAGTCCCAGTGAATATAGAGTAATA 
GAAATGGAGAAAACAAAATTTTATATAGATAAATATTTGCAAAGAAAAAGCGATTCTATTCTTGGAATTTTTCTAT 
TTACATTGTTTGCATCATTTACTATTTTTTTAATGAATTTTTATAAATTTTTTAAAGCAAGCTTTTTAAATCCTAT 
TATTTTAATGACAAAAATTTTACAAGACCCATTAGAATATCGAAAAATTCAAATTCCTTTTACTTTAAGCGAAGAA 
AAAGTATATGAACTTGCAAAATCATTTAACAATCTCTTGCTAAAAGAAAAACTAAACTCAAAGCGAAAAAGCAAAA 
T - CCTTTAGAAATTGAAAAAGTAAAAAAAATAATTAATAAAAACC AGGAAATAAAATGA 



ATAATAATTTTTTCAATATTTGAACTTTTAATCGAAGAACTCTCAATAATTCTTTTTTTACCATACAAAATACGAT 
TTGCACTAATATTTCTTGGGTTTCTATTTGACACAATTTTTATTTTCATTTTTTTATACAAAATAACCAAGGCCTA 
CCTTTCCCAAAGATTAGAAATCTACGTCAGAAACAATCTATTCTTCGATATAATCCACTGCCTTATTCCTTTAGCG 
TTTTATAGCTCATATCAGCTTAAAAACATAATTGTCGCCCATGAAACAATATTAAATCCAATAATGCTATCACTTT 
TCAAGTTAAGA'TTTTTAAGACTTCTTAGGTTTAATGACCTAATAATAGAAATATATTACAATTCAAAAGAAAAGAA 
CCTAATACTAATAGCATTTGCTAGGACATTTTCAATGAGCTTATTAATACCATTTACATTTTTTATAATAATATCA 
AGCTCAAAAATTGTAAATTCAATACCAGAAAAACAAGAATTTAATATCATTAAAAATATATCAATAATAAA'TGAAA 
AAGCTTACATTAAAGAAAAATATCCCTTCATCTTAATAATCAAGGAAAAAGATGACATAATATACTCAAAATCAGA 
CGAAATATTTGTTTACTACAGTCCCAGTGAATATAGAGTAATAGAAATGGAGAAAACAAAATTTTATATAGATAAA 
TATTTGCAAAGAAAAAGCGATTCTATTCTTGGAATTTTTCTATTTACATTGTTTGCATCATTTACTATTTTTTTAA 
TGAATTTTTATAAATTTTTTAAAGCAAGCTTTTTAAATCCTATTATTTTAATGACAAAAATTTTACAAGACCCATT 
AGAATATCGAAAAATTCAAATTCCTTTTACTTTAAGCGAAGAAAAAGTATATGAACTTGCAAAATCATTTAACAAT 
CTCTTGCTAAAAGAAAAACTAAACTCAAAGCGAAAAAGCAAAATACCTTTAGAAATTGAAAAAGTAAAAAAAATAA 
TTAATAAAAACCAGGAAATAAAATGA 



MKIQIIIMLLALLDFPLNARLLDISIEKRADEEIKKYSSYNLILEKEYYTNFPTSEISKNIYKLTEHFVKSIMLNK 
TMYSLLNSNYKEANKYLIQSELIDKKFLKYKIFKIKNINGIFKSHSLIYTKKGFYKLELYIENNAEPLKIFNLNIT 
YFLKNLDKISNEMIFFPREKREVNMIQKTTIAADSSSKPRGINYDTGIPFNVLIVDDSVFTVKQLTQIFTSEGFNI 
IDTAADGEEAVIKYKiSrHYPNIDIVTLDITMPKMDGITCLSNIMEFDKNARVIMISALGKEQLVKDCLIKGAKTFIA/ 
KPLDRAKVLQRVMSVFVK 



RLLDISIEKRADEEIKKYSSYNLILEKEYYTNFPTSEIEKNIYKLTEHFVKSIMLNKTJSTYSLLNSNYKEANKYLIQ 
SELIDKKFLKYKIFKIKNINGIFKSHSLIYTKKGFYKLELYIENNAEPLKIFNLNITYFLKNLDKISNEMIFFPRE 
KREVNMIQKTTIAADSSSKPRGINYDTGIPFWLIVDDSVFTVKQLTQIFTSEGFNIIDTAADGEFAVIKYKNHYP 
NIDIVTLDITMPKMDGITCLSNIMEFDKNARVIMISALGKEQLVKDCLIKGAXTFIVKPLDRAKVLQR^/MSVFVK 



ATGAAAATTCAAATAATTATAATGCTGCTTGCATTGTTAGATTTTCCACTTAATGCCAGACTTTTGGACATTTCAA 
TTGAAAAAAGAGCAGATGAAGAAATAAAAAAATATTCGTCTTATAATTTAATTTTAGAAAAAGAATACTATACCAA 
TTTTCCAACAAGCGAAATAGAAAAAAATATTTATAAACTAACAGAACATTTTGTAAAAAGCATAATGCTCAATAAA 
ACTAACTACAGCTTATTAAATTCAAACTACAAAGAAGCAAATAAATATCTAATTCAJ\AGCGAACTCATTGATAAAA 
AATTTTTAAAATATAAAATATTTAAAATCAAAAATATAAATGGAATTTTTAAAAGCCATTCACTAATATATACAAA 



t2 08 .nt 



f210.aa 



t210 .aa 



f210 .nt 
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AAAAGGATTTTACAAATTAGAACTTTACATAGAAAATAATGCAGAACCTCTAAAAATATTTAACCTTAACATTACT 
TATTTTTTAAAGAATTTAGATAAAATAAGTAATGAAATGATTTTTTTCCCAAGGGAATGA 

t210 .nt 

AGACTTTTGGACATTTCAATTGAAAAAAGAGCAGATGAAGAAATAAAAAAATATTCGTCTTATAATTTAATTTTAG 
AAAAAGAATACTATACCAATTTTCCAACAAGCGAAATAGAAAAAAATATTTATAAACTAACAGAACATTTTGTAAA 
AAGCATAATGCTCAATAAAACTAACTACAGCTTATTAAATTCAAACTACAAAGAAGCAAATAAATATCTAATTCAA 
AGCGAACTCATTGATAAAAAATTTTTAAAATATAAAATATTTAAAATCAAAAATATAAATGGAATTTTTAAAAGCC 
ATTCACTAATATATACAAAAAAAGGATTTTACAAATTAGAACTTTACATAGAAAATAATGCAGAACCTCTAAAAAT 
ATTTAACCTTAACATTACTTATTTTTTAAAGAATTTAGATAAAATAAGTAATGAAATGATTTTTTTCCCAAGGGAA 
TGA 

f 22 ,aa 

MLKTLTKIITISCLIVGCASLPYTPPKQNLMYLMELLPGANLYAHVNLIKMRSIYNSLSPKYKSVLGLISNLYFSY 
KKEKWDFALLIMGNFPKDIFWGIHKNP^TESIGNIFTNPKWK^ 

KYIGEIEKNEMFFWIQDPTLLLPNQIVSSKNLIPFSSGTLSINSLNQEEYIFKSLIKTNNPPILKILSKKLIPTVL 
TNMTNLTISSHIKTTIKDQNTVEIEFNIQKSSVESLIEKLASNIQT 

t22 .aa 

P YTP PKQNLNYLMEL LPGANL YAHVNL IKNRS I YNS LS PKYKSVLGL I SNLYF S YKKENNDFALL IMGNF PKD IFW 
GIHKNRNTES IGNIFTNPKWKLKNSNI YI I PNKARTSI AITQKDITAKDMNMLTTKYIGEIEKNEMFFWIQDPTLL 
LPNQIVSSKNLIPFSSGTLSINSLNQEEYIFKSLIKTNNPPILKILSKKLIPTVLTNMTNLTISSHIKTTIKDQNT 
VEIEFNIQKSSVESLIEKLASNIQT 

f22.nt 

ATGTTAAAAACATTAACAAAAATAATTACCATTTCATGCCTCATAGTGGGATGCGCAAGCCTGCCTTACACTCCTC 
CAAAACAAAATCTAAATTACTTAATGGAACTTTTACCTGGCGCAAATTTATACGCCCATGTAAATTTAATTAAAAA 
CAGGTCTATTTATAACTCTTTAAGCCCTAAATATAAATCAGTTCTTGGGCTTATAAGCAATTTATACTTTAGCTAT 
AAAAAAGAAAATAACGATTTTGCTCTACTAATAATGGGTAATTTCCCAAAAGATATTTTCTGGGGAATTCATAAAA 
ATAGAAATACAGAATCAATAGGCAATATATTTACAAATCCAAAATGGAAACTTAAAAATTCAAATATATACATTAT 
TC C AAAC AAAGC T AG AAC TAGC ATTGC AAT AAC C C AAAAAGAT AT AAC C GC AAAAGAC AAT AAT ATGCT AAC AAC A 
AAATATATTGGGGAAATAGAAAAAAATGAAATGTTTTTTTGGATTC AAGATC C AACATTAT TGC TC CCAAAC CAAA 
TAGTAAGCAGCAAAAATTTAATTCCCTTTAGCAGTGGAACTTTGTCTATAAACAGCTTAAATCAAGAAGAATATAT 
TTTTAAATCCTTAATCAAAACAAATAATCCACCAATACTAAAAATATTGTCAAAAAAGTTAATTCCAACCGTCTTG 
ACAAACATGACAAACCTCACAATATCAAGCCACATAAAGACCACAATAAAAGACCAAAATACGGTTGAAATAGAAT 
TT AAT ATTC AAAAAT C TAGTGTTG AAAGC C TT AT AGAAAAAC T AGCTTC AAAT AT TC AAAC CT AA 

t.22 .nt 

CCTTACACTCCTCCAAAACAAAATCTAAATTACTTAATGGAACTTTTACCTGGCGCAAATTTATACGCCCATGTAA 
ATTTAATTAAAAACAGGTCTATTTATAACTCTTTAAGCCCTAAATATAAATCAGTTCTTGGGCTTATAAGCAATTT 
ATACTTTAGCTATAAAAAAGAAAATAACGATTTTGCTCTACTAATAATGGGTAATTTCCCAAAAGATATTTTCTGG 
GGAATTCATAAAAATAGAAATACAGAATCAATAGGCAATATATTTACAAATCCAAAATGGAAACTTAAAAATTCAA 
ATATATACATTATTCCAAACAAAGCTAGAACTAGCATTGCAATAACCCAAAAAGATATAACCGCAAAAGACAATAA 
TATGCTAACAACAAAATATATTGGGGAAATAGAAAAAAATGAAATGTTTTTTTGGATTCAAGATCCAACATTA.TTG 
CTCCCAAACCAAATAGTAAGCAGCAAAAATTTAATTCCCTTTAGCAGTGGAACTTTGTCTATAAACAGCTTAAATC 
AAGAAGAATATATTTTTAAATCCTTAATCAAAACAAATAATCCACCAATACTAAAAATATTGTCAAAAAAGTTAAT 
TCCAACCGTCTTGACAAACATGACAAACCTCACAATATCAAGCCACATAAAGACCACAATAAAAGACCAAAATACG 
GTTGAAATAGAATTTAATATTCAAAAATCTAGTGTTGAAAGCCTTA-TAGAAAAACTAGCTTCAAATATTCAAACCT 
AA 



f221 .aa 
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MGITVFYLFSIFASFVLGSSMDSVKENVLKSTIFYYDVEEVEFPYARKQTLQFIAKTHLKYAVFNFDKjSTKMFSYTF 
VFDKKL I SQ YAIF I EVKKKFGEATL VTPLNYLWDLGDS I I VLNKNILRITLKSY I SNYNK 

t221.aa 

SMDS VKENVLKST I F YYDVEEVEF P YARKQTLQF I AKTHLKYAVFNFDKNKMFS YTFVFDKKL I S Q YAIF I EVKKK 
FGEATLVTPLNYLWDLGDSIIVLNKNILRITLKSYIS3STYNK 

f221.nt 

ATGGGTATTACAGTTTTTTATTTATTTTCTATTTTTGCATCTTTTGTTCTGGGTTCTAGCATGGATTCTGTTAAAG 
AGAATGTTCTCAAGAGCACTATTTTTTATTATGATGTTGAAGAAGTTGAATTTCCTTATGCTAGGAAGCAGACTTT 
ACAATTTATTGCTAAAACCCATTTAAAATATGCTGTTTTTAATTTTGACAAAAATAAAATGTTTTCGTACACTTTT 
GTTTTTGATAAAAAATTAATATCTCAGTATGCAATTTTTATTGAGGTAAAGAAAAAGTTTGGCGAGGCTACACTAG 
TAACGCCTTTGAATTATTTATGGGATCTTGGTGATTCTATTATTGTTTTAAATAAAAATATTTTAAGAATTACTTT 
AAAATCTTATATTTCAAATTATAATAAATGA 

t221.nt 

AGCATGGATTCTGTTAAAGAGAATGTTCTCAAGAGCACTATTTTTTATTATGATGTTGAAGAAGTTGAATTTCCTT 
ATGCTAGGAAGCAGACTTTACAATTTATTGCTAAAACCCATTTAAAATATGCTGTTTTTAATTTTGACAAAAATAA 
AATGTTTTCGTACACTTTTGTTTTTGATAAAAAATTAATATCTCAGTATGCAATTTTTATTGAGGTAAAGAAAAAG 
TTTGGCGAGGCTACACTAGTAACGCCTTTGAATTATTTATGGGATCTTGGTGATTCTATTATTGTTTTAAATAAAA 
ATATTTTAAGAATTACTTTAAAATCTTATATTTCAAATTATAATAAATGA 

f253 .aa 

MYMENISVRGQPNFFGLIPFFVFIIIYLGTGIYLGVIGVEMAFYQLPASVAMFFASIVCFLVFKGKFSDKIHIFIK 
GAAQYDIILMCLIFMLSGAFSSLCKEIGCVETVANLGIKYINPNWIVSGIFFVTCFLSFSAGTSVGSIVAIAPIAF 
NIAVKSGINPNLIAASVMCGAMFGDNLSLISDTTIVSSRTQGSSILDVFISSSFYAFPSAILTFFSFFFLSENLSN 
ATNFLHESSIDLVKTVPYLMIIFFSLAGMWVFIVLFLGILSICLISVLYGNLYFLDVMKNINKGFLIMADLIFLSI 
LTGGVSFAVIHNGGFKWLLIKLKSLIRGKSSAEFSIGAFVSIVDVFLANNTIAILICGKVAKKIAFENNISVQRSA 
SILDMFSCIFQGIIPYGAQMIILVNFSNGLVSPISILPFLVYFGFLLFFVILSILGLDIKKVFLFFLKK 

t253 .aa 

LVFKGKFSDKIHIFIKGAAQYDIILMCLIFMLSGAFSSLCKEIGCVETVANLGIKYINPNWIVSGIFFVTCFLSFS 
AGTSVGSIVAIAPIAFNIAVKSGINPNLIAASVMCGAMFGDNLSLISDTTIVSSRTQGSSILDVFISSSFYAFPSA 
ILTFFSFFFLSENLSNATNFLHESSIDLVKTVPYLMIIFFSl^GMWFIVLFLGILSICLISVLYGNLYFLDVMKN 
INKGFLMXIADLIFLSILTGGVSFAVTHNGGFKWLLIKLKSLIRGKSSAEFSIGAFVSIVDVFLANNTIAILICGKV 
AKKIAFFJSENISVQRSASILDMFSCIFQGIIPYGAQMIILVNFSNGLVSPISILPFLVYFGFLLFFVILSILGLDIK 
KVFLFFLKK 

f253 .nt 

ATGTATATGGAAAATATTGAAGTAAGAGGGCAGCCAAATTTTTTTGGGCTTATTCCTTTTTTTGTTTTTATTATTA 
TCTATTTAGGCACGGGGATTTATTTGGGAGTTA'TTGGTGTAGAAATGGCCTTTTATCAACTGCCGGCTAGTGTTGC 
AATGTTTTTTGCTTCCATTGTTTGTTTTTTGGTATTTAAAGGAAAATTTTCCGACAAAATTCACATATTTATTAAA 
GGAGCAGCTCAGTACGATATTA.TACTAATGTGTCTTATTTTTATGCTTTCGGGAGCTTTCTCTTCTCTTTGTAAAG 
AAATAGGCTGCGTTGAAACTGTAGCAAATTTGGGAATTAAATATA.TTAATCCTAATTGGATTGTTTCTGGTATATT 
TTTTGTAACCTGCTTTCTTTCTTTTTCTGCCGGCACTTCTGTTGGATCTATCGTTGCAATTGCTCCTATTGCTTTT 
AATATTGCTGTTAAAAGCGGCATTAATCCGAATTTAATAGCAGCATCTGTAATGTGTGGAGCTATGTTTGGAGATA 
ATCTTTCTTTAATATCAGATACAACTATTGTTTCTAGTCGAACTCAAGGTAGTAGCATCTTAGATGTTTTTATTAG 
TAGCAGTTTTTATGCTTTTCCATCCGCCATACTAACTTTTTTTTCTTTTTTCTTTCTTTCTGAAAATTTGTCCAAT 
GCCACAAACTTTTTACACGAAAGTTCAATAGATTTAGTGAAAACTGTGCCTTATTTAATGATTATATTTTTCTCTT 
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TAGCTGGAATGAATGTTTTTATAGTTCTTTTTTTAGGTATTCTTTCTATATGTCTTATTAGCGTTTTGTATGGTAA 
TTTATACTTTCTAGATGTAATGAAAAACATTAATAAAGGGTTTTTAAATATGGCGGATTTGATTTTTCTTTCAATT 
TTAACAGGGGGAGTTTCTTTTGCCGTGATTCATAATGGAGGCTTTAAATGGCTACTTATTAAATTAAAATCCTTGA 
TTAGAGGAAAAAGTTCAGCGGAATTTTCTATTGGGGCTTTTGTTTCAATAGTTGATGTTTTTCTTGCTAATAACAC 
AATTGCCATACTTATTTGCGGCAAAGTAGCAAAAAAGATAGCTTTTGAAAATAACATCAGTGTTCAAAGAAGTGCT 
TCTATTTTAGATATGTTCTCTTGTATTTTTCAAGGCATTATTCCTTATGGTGCGCAAATGATTATTTTAGTGAATT 
TTTCAAATGGACTTGTGTCGCCAATTAGTATTTTGCCATTTTTAGTTTATTTTGGATTTTTATTGTTTTTTGTTAT 
TTTATCTATTTTGGGCCTTGATATAAAAAAAGTTTTTTTATTTTTTTTAAAAAAATAA 

t253 .nt 

TTGGTATTTAAAGGAAAATTTTCCGACAAAATTCACATATTTATTAAAGGAGCAGCTCAGTACGATATTATACTAA 
TGTGTCTTATTTTTATGCTTTCGGGAGCTTTCTCTTCTCTTTGTAAAGAAATAGGCTGCGTTGAAACTGTAGCAAA 
TTTGGGAATTAAATATATTAATCCTAATTGGATTGTTTCTGGTATATTTTTTGTAACCTGCTTTCTTTCTTTTTCT 
GCCGGCACTTCTGTTGGATCTATCGTTGCAATTGCTCCTATTGCTTTTAATATTGCTGTTAAAAGCGGCATTAATC 
CGAATTTAATAGCAGCATCTGTAATGTGTGGAGCTATGTTTGGAGATAATCTTTCTTTAATATCAGATACAACTAT 
TGTTTCTAGTCGAACTCAAGGTAGTAGCATCTTAGATGTTTTTATTAGTAGCAGTTTTTATGCTTTTCCATCCGCC 
ATACTAACTTTTTTTTCTTTTTTCTTTCTTTCTGAAAATTTGTCCAATGCCACAAACTTTTTACACGAAAGTTCAA 
TAGATTTAGTGAAAACTGTGCCTTATTTAATGATTATATTTTTCTCTTTAGCTGGAATGAATGTTTTTATAGTTCT 
TTTTTTAGGTATTCTTTCTATATGTCTTATTAGCGTTTTGTATGGTAATTTATACTTTCTAGATGTAATGAAAAAC 
ATTAATAAAGGGTTTTTAAATATGGCGGATTTGATTTTTCTTTCAATTTTAACAGGGGGAGTTTCTTTTGCCGTGA 
TTCATAATGGAGGCTTTAAATGGCTACTTATTAAATTAAAATCCTTGATTAGAGGAAAAAGTTCAGCGGAATTTTC 
TATTGGGGCTTTTGTTTCAATAGTTGATGTTTTTCTTGCTAATAACACAATTGCCATACTTATTTGCGGCAAAGTA 
GCAAAAAAGATAGCTTTTGAAAATAACATCAGTGTTCAAAGAAGTGCTTCTATTTTAGATATGTTCTCTTGTATTT 
TTCAAGGCATTATTCCTTATGGTGCGCAAATGATTATTTTAGTGAATTTTTCAAATGGACTTGTGTCGCCAATTAG 
TATTTTGCCATTTTTAGTTTATTTTGGATTTTTATTGTTTTTTGTTATTTTATCTATTTTGGGCCTTGATATAAAA 
AAAGTTTTTTTATTTTTTTTAAAAAAATAA 

f 265 .aa 

l^KCFVSLSLLLIFFACSSWEIELNDDISGIVSIFVNVWREFEKIRKELLTTLVGEEIANMPLFPVDEIKKYFKN 

GEEKLGLKLLSIKTQGDSINLWKFDNLIKILGDYMKKPDISVFKIEKKDGKNIIELNINLENATKNINENKEYIS 

DALAALLPSDEIPMSAKEYKDVLVYFLSDFTSKASELIDNSKLNLWKTSFJWQEQFGFKQINSNTLRFEMDMVKG 

LSLETPIKLRLV 

Y 

t265 .aa 

SWEIELNDDISGIVSIFVWWREFEKIRKELLTTLVGEEIANMPLFPVDEIKKYFKNGEEKLGLKLLSIKTQGDS 
INLWKFDNLIKILGDYMKKPDISVFKIEKKDGKNIIELNINLENATKNINENKEYISDALAALLPSDEIPMSAKE 
YKD^/LVYFLSDFTSKASELIDNSKLNLVVKTSPJWQEQFGFKQINSNTLRFEMDMVKGLSLETPIKLRLVY 

f265.nt 

ATGAGAAAGTGTTTTGTTAGCTTGAGTTTATTGTTGATTTTTTTTGCTTGTAGCTCTAAT&TTGAAATTGAGTTAA 
ATGATGATATTAGTGGTATTGTTTCAATATTTGTTAATGTTAATAGAGAATTTGAAAAAATTAGAAAAGAACTCTT 
AACAACTTTGGTGGGAGAAGAAATTGCAAATATGCCTCTTTTTCCTGTAGATGAAATAAAAAAATACTTTAAAAAT 
GGAGAGGAAAAGCTTGGGCTTAAGCTTTTGAGTATTAAAACCCAAGGAGATTCTATTAATTTAGTTGTTAAGTTTG 
ATAATTTAATTAAAATTTT AGGC GATT AT ATGAAAAAACC CGATAT ATC TGTGTTTAAGATAGAAAAAAAAG ATGG 
TAAAAATATTATTGAACTTAATATTAATTTGGAAAACGCTACTAAGAATATTAATGAAAATAAAGAATATATTAGT 
GATGCACTTGCTGCTCTTTTGCCATCGGATGAGATCCCAATGTCTGCCAAAGAATATAAAGATGTTTTGGTTTATT 
TTTTATCGGATTTTACTTCCAAAGCAAGTGAACTTATTGACAATTCCAAACTTAATCTTGTAGTTAAGACTTCTAG 
AAATGTTCAAGAACAATTTGGATTCAAACAAATTAACTCAAACACACTGCGGTTTGAGATGGATATGGTTAAAGGA 
TTAAGTCTTGAAACACCAATAAAACTTAGATTAGTTTATTGA 



t265 .nt 
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■ TCTAATGTTGAAATTGAGTTAAATGATGATATTAGTGGTATTGTTTCAATATTTGTTAATGTTAATAGAGAATTTG 
AAAAAATTAGAAAAGAACTCTTAACAACTTTGGTGGGAGAAGAAATTGCAAATATGCCTCTTTTTCCTGTAGATGA 
AATAAAAAAATACTTTAAAAATGGAGAGGAAAAGCTTGGGCTTAAGCTTTTGAGTATTAAAACCCAAGGAGATTCT 
ATTAATTTAGTTGTTAAGTTTGATAATTTAATTAAAATTTTAGGCGATTATATGAAAAAACCCGATATATCTGTGT 
TTAAGATAGAAAAAAAAGATGGTAAAAATATTATTGAACTTAATATTAATTTGGAAAACGCTACTAAGAATATTAA 
TGAAAATAAAGAATATATTAGTGATGCACTTGCTGCTCTTTTGCCATCGGATGAGATCCCAATGTCTGCCAAAGAA 
TATAAAGATGTTTTGGTTTATTTTTTATCGGATTTTACTTCCAAAGCAAGTGAACTTATTGACAATTCCAAACTTA 
ATCTTGTAGTTAAGACTTCTAGAAATGTTCAAGAACAATTTGGATTCAAACAAATTAACTCAAACACACTGCGGTT 
TGAGATGGATATGGTTAAAGGATTAAGTCTTGAAACACCAATAAAACTTAGATTAGTTTATTGA 

f 269 .aa 

MNIRKLLFCIFFMNISFLLFAGDYKGLDFKIKFFNQSIYRVNSNVFIEVSLSNASESVLTLEIGDINSFGFDFDVT 
DTTNIKVKRPIEYVKKRSKNVAIPVRNMSLRPNEKFSWINLNQFVKFSKDGVYFVKGIFFPDISDPSKKKESNII 
TLFLNDGFDENPGSIDLWLSENNDIQDILKKKKLSPDEIVKYLLKALQLGKKEKFFLYLDIEGLLLNDKGKAYLY 
KQKLSPIPNKNWEEYKEYLWNSNNSDISKAPNKFSIIETTYSDTSGKVIADLYFDDGQFYISKRYTFFFKKYDYY 
WIIYDYIVQNTGIKEK 

t269 .aa 

GDYKGLDFKIKFFNQSIYRWSNVFIEVSLSNASESVLTLEIGDINSFGFDFDVTDTTNIKVKRPIEYVKKRSKNV 
AIPVRNMSLRPKTEKFSWINLNQFVKFSKDGVYFVKGIFFPDISDPSKKKESNIITLFLNDGFDENPGSIDLVNLS 
ENNDIQDILKKKKLSPDEIVKYLLKALQLGKKEKFFLYLDIEGLLLNDKGKAYLYKQKLSPIPNKNWEEYKEYLW 
NSNNSDISKAPNKFSIIETTYSDTSGKVIADLYFDDGQFYISKRYTFFFKKYDYYWIIYDYIVQNTGIKEK 

f269.nt 

ATGAATATTAGAAAATTGCTTTTTTGTATCTTTTTTATGAATATTTCTTTTCTTTTGTTTGCGGGAGATTACAAGG 
GCCTTGATTTTAAAATCAAGTTTTTTAATCAATCTATTTATCGTGTCAATAGTAATGTTTTTATTGAAGTTTCTCT 
TAGTAATGCGTCTGAGAGTGTTTTAACTTTAGAAATAGGCGATATTAATTCTTTTGGCTTTGATTTTGATGTTACT 
GATACCACCAATATTAAAGTTAAAAGACCTATTGAATATGTTAAAAAGAGATCTAAAAATGTTGCAATTCCTGTTA 
GAAATATGAGCTTGAGACCTAATGAAAAATTTTCTGTAGTTATTAACTTAAATCAATTTGTTAAGTTTAGTAAAGA 
TGGAGTTTATTTTGTTAAGGGTATTTTTTTCCCAGACATTTCAGATCCATCTAAGAAAAAAGAATCCAATATTATT 
ACGCTTTTTTTGAATGATGGTTTTGATGAAAATCCAGGTAGCATAGACCTTGTTAATTTGTCTGAAAATAATGATA 
TTCAAGATATCTTGAAAAAGAAAAAATTATCTCCCGATGAAATTGTTAAATATTTGTTAAAGGCATTGCAGCTTGG 
GAAAAAAGAAAAGTTCTTTTTATATCTTGATATTGAAGGTTTGTTATTAAATGACAAGGGCAAGGCATACCTTTAT 
AAGCAAAAGTTATCACCTATTCCCAATAAAAATGTAGTTGAAGAGTATAAAGAATATTTGTGGAATTCTAATAATT 
CGGATATTTCAAAAGCACCAAATAAATTTTCTATTATTGAAACTACTTATTCTGATACTTCTGGCAAGGTGATTGC 
TGATTTATATTTTGACGATGGGCAATTTTATATTTCCAAAAGATATACTTTCTTCTTTAAAAAATATGATTATTAT 
TGGATAATTTATGATTACATTGTTCAAAATACTGGCATTAAGGAAAAGTAA 

t269 .nt 

GGAGATTACAAGGGCCTTGATTTTAAAATCAAGTTTTTTAATCAATCTATTTATCGTGTCAATAGTAATGTTTTTA 
TTGAAGTTTCTCTTAGTAATGCGTCTGAGAGTGTTTTAACTTTAGAAATAGGCGATATTAATTCTTTTGGCTTTGA 
TTTTGATGTTACTGATACCACCAATATTAAAGTTAAAAGACCTATTGAATATGTTAAAAAGAGATCTAAAAATGTT 
GCAATTCCTGTTAGAAATATGAGCTTGAGACCTAATGAAAAATTTTCTGTAGTTATTAAGTTAAATCAATTTGTTA 
AGTTTAGTAAAGATGGAGTTTATTTTGTTAAGGGTATTTTTTTCCCAGACATTTCAGATCCATCTAAGAAAAAAGA 
ATCCAATATTATTACGCTTTTTTTGAATGATGGTTTTGATGAAAATCCAGGTAGCATAGACCTTGTTAATTTGTCT 
GAAAATAATGATATTCAAGATATCTTGAAAAAGAAAAAATTATCTCCCGATGAAATTGTTAAATATTTGTTAAAGG 
CATTGCAGCTTGGGAAAAAAGAAAAGTTCTTTTTATATCTTGATATTGAAGGTTTGTTATTAAATGACAAGGGCAA 
GGCATACCTTTATAAGCAAAAGTTATCACCTATTCCCAATAAAAATGTAGTTGAAGAGTATAAAGAATATTTGTGG 
AATTCTAATAATTCGGATATTTCAAAAGCACCAAATAAATTTTCTATTATTGAAACTACTTATTCTGATACTTCTG 
GCAAGGTGATTGCTGATTTATATTTTGACGATGGGCAATTTTATATTTCCAAAAGATATACTTTCTTCTTTAAAAA 
ATATGATTATTATTGGATAATTTATGATTACATTGTTCAAAATACTGGCATTAAGGAAAAGTAA 
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f29.aa 

MNWLSFFYVLLFLLIFPFELQSNNKENISNLIKLHMLYDLTNNLSKELETINKIKNFDLEQHYLLITKYYLKIKKY 
KEANDFLKKINQKKIKNQKIKNEIISLKLRINEDNINEEEIKKILNNEKNIDVKIIYQIFSLIKFKNKKLANKIKN 
IILTNYPKSIYSYKIKRNE 



t29 .aa 

NNKENIENLIKLHMLYDLTNNLSKELETINKIKNFDLEQHYLLITKYYLKIKKYKEANDFLKKINQKKIKNQKIKN 
EIISLKLRINEDNINEEEIKKILNNEKNIDVKIIYQIFSLIKFKNKKLANKIKNIILTNYPKSIYSYKIKRNE 

f29 .nt 

ATGAACTGGCTATCCTTTTTTTATGTTTTATTATTTTTATTAATTTTTCCTTTTGAATTACAGAGTAATAATAAAG 
AAAATATAGAAAATTTAATAAAGCTACATATGCTTTATGATTTAACCAATAACCTGTCAAAAGAATTAGAAACAAT 
AAATAAAATTAAAAATTTTGACTTAGAACAACATTATCTGCTAATTACAAAATATTATCTAAAAATAAAAAAATAT 
AAAGAAGCTAATGATTTTTTAAAAAAAATAAACCAAAAAAAGATCAAAAATCAAAAAATAAAAAACGAAATCATTT 
CGCTAAAATTAAGAATAAATGAAGATAATATTAATGAAGAAGAAATCAAAAAAATTTTAAATAACGAAAAAAATAT 
AGATGTCAAAATAATTTATCAAATATTCAGTCTTATAAAATTTAAAAATAAAAAATTAGCAAATAAAATTAAAAAC 
ATAATACTAACAAACTATCCCAAAAGCATTTATTCTTATAAAATAAAAAGAAATGAATAA 

t2 9 .nt 

AATAATAAAGAAAATATAGAAAATTTAATAAAGCTACATATGCTTTATGATTTAACCAATAACCTGTCAAAAGAAT 
TAGAAACAATAAATAAAATTAAAAATTTTGACTTAGAACAACATTATCTGCTAATTACAAAATATTATCTAAAAAT 
AAAAAAATATAAAGAAGCTAATGATTTTTTAAAAAAAATAAACCAAAAAAAGATCAAAAATCAAAAAATAAAAAAC 
GAAATCATTTCGCTAAAATTAAGAATAAATGAAGATAATATTAATGAAGAAGAAATCAAAAAAATTTTAAATAACG 
AAAAAAATATAGATGTCAAAATAATTTATCAAATATTCAGTCTTATAAAATTTAAAAATAAAAAATTAGCAAATAA 
' AATTAAAAACATAATACTAACAAACTATCCCAAAAGCATTTATTCTTATAAAATAAAAAGAAATGAATAA 

f290 .aa 

MNS I YVIGKLLLTLFL I FFPFCYNLFAVWLAEINKLiSE YAKS I VLIDFDTKRILYSKKPNLVFPPASLTKIVTIYT 
ALIEAEKRNIKLKSIVPISDSASYYNAPPNSSLMFLEKGQIVNFEEILKGLSVSSGNDSSIA.IAEFWGNLNSFVN 
LMNINVLNLGLFNMHFVEPSGYSSENKITALDMAFFVKSYIEKFKFMLNIHSLKYFIYPKSRNLGTALSSKFLNLK 
QRNANLLIYDYPYSDGIKTGYIKESGLNLVATAKKGERRLIAVVLGVEKGINGFGEKMRSSIAKNLFEYGFNKYSK 
FPLIVKLKEKVYNGTVDTVALFSKEPFYYILTKDEFDKINISYTVDKLVAPLSGDMPVGRAMIFLENEKIGDVALF 
S GKVKRLGFWQGL YKS F INLF SRE Y 

t290 .aa 

VNLAEINKLSEYAKSIVLIDFDTKRILYSKKPNLVFPPASLTKIVTIYTALIEAEKRNIKLKSIVPISDSASYYNA 
PPNSSLMFLEKGQIWFEEILKGLSVSSGNDSSIAIAEFWGNLNSFWLMNINVLNLGLFI^HFVEPSGYSSENK 
ITALDMAFFVKSYIEKFKFMLNIHSLKYFIYPKSRNLGTALSSKFLNLKQRNANLLIYDYPYSDGIKTGYIKESGL 
NLVATAKKGERRLIAVVLGVEKGINGFGEKMRSSIAKNLFEYGFNKYSKFPLIVKLKEKVYNGTVDTVALFSKEPF 
YYILTKDEFDKINISYTVDKLVAPLSGDMPVGRAMIFLENEKIGDVALFSGKVKRLGFWQGLYKSFINLFSREY 

f2?0.nt 

ATGAATAGTATCTATGTTATTGGGAAATTGTTATTAACTTTATTTTTAATTTTTTTCCCGTTTTGTTATAATCTTT 
TTGCAGTTAATTTAGCTGAGATTAATAAATTATCAGAGTATGCAAAGTCAATAGTTTTAATAGATTTTGATACTAA 
GCGAATACTTTATTCTAAGAAGCCCAATTTGGTTTTTCCTCCAGCATCTCTTACAAAGATTGTTACAATTTATACA 
GCTTTAATTGAAGCTGAAAAGCGAAA.TATAAAATTAAAAAGCATAGTTCCTATTAGCGATTCTGCTTCATATTATA 
ATGCACCCCCCAATTCTTCTTTGATGTTTTTAGAAAAAGGTCAAATTGTTAATTTTGAAGAGATTTTAAAAGGACT 
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TTCAG7T~CTTCGGGTAATGATTCTTCTATTGCAATTGCTGAGTTTGTAGTAGGCAATTTAAATAGCTTTGTTAA'r 
TTA-ATGAATATTAATGTTTTAAATTTAGGGCTTTTTAATATGCATTTTGTTGAACCTTCTGGATATAGCAGCGAGA 
ATAA.GArCACAGCACTAGATATGGCTTTTTTTGTGAAATCTTATATAGAAAAGTTTAAATTTATGCTTAATATTCA 
TTCTTCAAAC-TATTTTATTTATCCAAAGAGTAGAAATTTAGGAACTGCTTTGTCATCAAAATTTTTAAACTTAAAA 
CAAAGAAA7C-CTAATTTATTAATATATC-ATTACCCTTATTCAGATGGCATTAAAACGGGATATATTAAGGAATCAG 
GCT'TAAArCTTGTTGCTACTGCTAAAAAGGGTGAGA.GAAGATTAATAGCAGTTGTATTGGGGGTTGAAAAAGGAAT 
TAATC^ArTTGGAGAGAAAATGA-GATCTTCGATTGCAAAAAATTTATTTGAATATGGATTTAATAAA-TATTCTAAA 
TTCCCCCCA-ATAGTAAAATTAAAAGAAAAAGTCTATAATGGTACAGTGGATACAGTTGCTCTTTTTTCTAAAGAGC 
CTCTT^ArTATATTTTAACTAAAGATGAATTTGATAAAATTAATATAAGTTATACTGTTGATAAATTGGTTGCTCC 
'ACT-TAGTGGGGATATGCCTGTTGGGAGGGCTATGATTTTTTTAGAAAATGAAAAAATAGGGGATGTTGCTTTGTTT 
AGTGGCAAGGTAJIAAAGATTAGGGTTTTGGCAAGGTCTTTATAAGAGTTTTATAAATCTTTTTTCAAGAGAGTATT 



GTTAA'rCCAGCTGAGATTAATAAATTATCAGAGTATGCAAAGTCAATAGTTTTAATAGATTTTGATACTAAGCGAA 
TACTTTATTCTAAGAAGCCCAATTTGGTTTTTCCTCCAGCATCTCTTACAAAGATTGTTACAATTTATACAGCTTT 
AATTGAAGC TGAAAAGCGAAAT ATAAAATTAAAAAGCAT AGTTC C TATTAGC GATTC TGC TTC AT ATT ATAATGC A 
CCCCCCAATTCTTCTTTGATGTTTTTAGAAAAAGGTCAAATTGTTAATTTTGAAGAGATTTTAAAAGGACTTTCAG 
TTTCT7CGGGTAATGATTCTTCTATTGCAATTGCTGAGTTTGTAGTAGGCAATTTAAATAGCTTTGTTAATTTAAT 
GAATATTAATGTTTTAAATTTAGGGCTTTTTAATATGCATTTTGTTGAACCTTCTGGATATAGCAGCGAGAATAAG 
ATTACAGCACTAGATATGGCTTTTTTTGTGAAATCTTATATAGAAAAGTTTAAATTTATGCTTAATATTCATTCTT 
TAAJ^GTACTTTATTTATCCAAAGAGTAGAAATTTAGGAACTGCTTTGTCATCAAAATTTTTAAACTTAAAACAAAG 
AAATGCTAATTTATTAATATATGATTACCCTTATTCAGATGGCATTAAAACGGGATATATTAAGGAATCAGGCTTA 
AATCTTGCTGCTACTGCTAAAAAGGGTGAGAGAAGATTAATAGCAGTTGTATTGGGGGTTGAAAAAGGAATTAATG 
GATTTGGAGAGAA-AATGAGATCTTCGATTGCAAAAAATTTATTTGAATATGGATTTAATAAATATTCTAAATTTCC 
TTTAATAGTAAAATTAAAAGAAAAAGTCTATAATGGTACAGTGGATACAGTTGCTCTTTTTTCTAAAGAGCCTTTT 
TATTATA-C'T'TTAACTAAAGATGAATTTGATAAAATTAATATAAGTTATACTGTTGATAAATTGGTTGCTCCACTTA 
GTGGGGArATGCCTGTTGGGAGGGCTATGATTTTTTTAGAAAATGAAAAAATAGGGGATGTTGCTTTGTTTAGTGG 
CAAGGTAAAAAGATTAGGGTTTTGGCAAGGTCTTTATAAGAGTTTTATAAATCTTTTTTCAAGAGAGTATTAA 



i^SYDFITAiVPIILIllGLGIIKKPAYWIPISLIATVAIVIFYKNLGIVNTSLAMLEGALMGIWPIATVIIAAI 
FTYKMSEDQKDIETIKNILSNVSSDRRIIVLLVAWGFGNFLEGVAGYGTAVAIPVSILIAMGFEPFFACLICLIMN 
TSSTAYGSVGIPITSLAQATNLDVNIVSSEIAFQLILPTLTIPFVLVILTGGGIKGLKGVFLLTLLSGMSMAISQV 
FISKTLGPELPAILGSILSMTITIVYARFFGNKETTERQSKNTISLSKGIIACSPYILIVTFIVLVSPLFNKIHEY 
LKTFQSTISIYPEANPLHFKWIISPGFLIILATTISYSIRGVPMLKQLKIFTLTLKKMALSSFIIICIVAISRLMT 
HSG^IPJDLAJMX-ISIITGKFGPLFSPLIGAIGTFLTGSDTVSWLFGPLQTQMAENIGANPYWLAAANTTGATGGKM 
ISPQNITIATTTAGLIGQEGKLLSKTIIYALYYILATGLLVYLV 

i:291.aa 

QKDIETIKNILSNVSSDRRIIVLLVAWGFGNFLEGVAGYGTAVAIPVSILIAMGFEPFFACLICLIMNTSSTAYGS 
VGZPITSLA.QATNLDVNIVSSEIAFQLILPTLTIPFVLVILTGGGIKGLKGVFLLTLLSGMSMAISQVFISKTLGP 
SLPAILGSILSMTITIVYARFFGNKETTERQSKNTISLSKGIIACSPYILIVTFIVLVSPLFNKIHEYLKTFQSTI 
SIYPEAN?LHFKWIISPGFLIILATTISYSIRGVPMLKQLKIFTLTLKK3^LSSFIIICIVAISRLMTHSGMIRDL 
ANGISIIC'GKFGPLFSPLIGAIGTFLTGSDTVSNVLFGPLQTQMAENIGAJSrPY^ifLAAANTTGATGGKMISPQNITI 
ATTTAGL IGQEGKLLS KTI IYALYY I LATGLLVYLV 



ATGAATTCTTATGATTTTATAACAGCTTTGGTACCAATAATCCTAATAATTATTGGACTTGGCATAATAAAAAAGC 
CAGCTTACTATGTAATACCCATATGATTAATAGCCACCGTTGCTATAGTTATATTTTATAAAAACTTGGGAATAGT 
AAACACAAGTCTTGCAATGCTTGAGGGCGCGTTAATGGGGATATGGCCAATAGCAACTGTAATTATTGCTGCCATA 
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TTTAC AT ACAAAATGTC AGAAGATC AAAAAGATAT AGAAACTATT AAAAAT ATTTTATC AAAC GTATC TTC TGATA 
GAAGAATTATAGTATTACTAGTTGCATGGGGATTTGGAAATTTTTTAGAAGGAGTTGCTGGATATGGAACTGCTGT 
TGCAATTCCTGTATCAATATTAATAGCAATGGGATTTGAACCATTTTTTGCCTGCTTAATCTGTTTAATAATGAAC 
ACCTCATCAACCGCCTACGGATCTGTGGGAATCCCTATAACATCTTTAGCTCAAGCAACTAACTTGGATGTTAACA 
•TTGTTTCATCTGAGATTGCATTCCAACTAATACTTCCAACCTTAACAATACCTTTTGTACTGGTAATTCTTACAGG 
AGGGGGCATTAAAGGATTAAAAGGAGTATTCCTTCTTACCTTACTCTCAGGAATGTCAATGGCAATATCTCAAGTA 
TTTATATCAAAAACTTTGGGTCCAGAACTTCCTGCAATCCTTGGAAGCATTCTTTCTATGACAATAACAATAGTTT 
ATGCAAGGTTTTTTGGAAATAAAGAAACTACTGAGCGCCAAAGCAAAAACACAATATCCTTATCAAAAGGAATTAT 
TGCCTGCTCACCCTACATTTTAATAGTAACTTTTATAGTGCTTGTATCTCCTCTTTTTAACAAAATTCATGAATAC 
CTAAAAACTTTTCAAAGCACTATTAGCATTTATCCAGAAGCAAATCCCTTACACTTTAAATGGATTATCTCTCCGG 
GCTTCTTGATTATACTTGCAACAACAATATCCTATTCAATACGGGGAGTTCCAATGTTAAAACAGCT AAAAAT ATT 
TACATTAACCTTGAAAAAAATGGCATTATCTTCCTTTATAATCATATGCATTGTTGCAATATCAAGATTAATGACA 
CATAGTGGAATGATAAGAGATCTTGCTAATGGAATCTCAATAATAACAGGTAAATTTGGACCATTATTTAGCCCAC 
TAATTGGAGCTATTGGGACATTTTTAACAGGAAGTGATACGGTTTCAAATGTTCTTTTTGGACCTTTACAAACACA 
Q AATGGCAGAAAATATTGGAGCAAATCCTTACTGGCTTGCAGCAGCAAATACAACAGGAGCAACTGGAGGGAAAATG 
= g ATTTCTCCCCAAAACATCACAATAGCAACAACAACTGCTGGATTAATTGGACAAGAAGGCAAGCTTTTATCAAAAA 
fi CAATAATTTATGCTTTATACTACATTTTAGCAACAGGATTGCTAGTTTATTTAGTATAA 



t291 .nt 

C AAAAAGATAT AGAAACTATT AAAAAT ATTTTATCAAACGTATCTTCTGATAGAAGAATTATAGTATTACTAGTTG 
CATGGGGATTTGGAAATTTTTTAGAAGGAGTTGCTGGATATGGAACTGCTGTTGCAATTCCTGTATCAATATTAAT 
AGCAATGGGATTTGAACCATTTTTTGCCTGCTTAATCTGTTTAATAATGAACACCTCATCAACCGCCTACGGATCT 
GTGGGAATCCCTATAACATCTTTAGCTCAAGCAACTAACTTGGATGTTAACATTGTTTCATCTGAGATTGCATTCC 
AACTAATACTTCCAACCTTAACAATACCTTTTGTACTGGTAATTCTTACAGGAGGGGGCATTAAAGGATTAAAAGG 
AGTATTCCTTCTTACCTTACTCTCAGGAATGTCAATGGCAATATCTCAAGTATTTATATCAAAAACTTTGGGTCCA 
GAACTTCCTGCAATCCTTGGAAGCATTCTTTCTATGACAATAACAATAGTTTATGCAAGGTTTTTTGGAAATAAAG 
AAACTACTGAGCGCCAAAGCAAAAACACAATATCCTTATCAAAAGGAATTATTGCCTGCTCACCCTACATTTTAAT 
AGTAACTTTTATAGTGCTTGTATCTCCTCTTTTTAACAAAATTCATGAATACCTAAAAACTTTTCAAAGCACTATT 
AGCATTTATCCAGAAGCAAATCCCTTACACTTTAAATGGATTATCTCTCCGGGCTTCTTGATTATACTTGCAACAA 
CAATATCCTATTCAATACGGGGAGTTCCAATGTTAAAACAGCTAAAAATATTTACATTAACCTTGAAAAAAATGGC 
ATTATCTTCCTTTATAATCATATGCATTGTTGCAATATCAAGATTAATGACACATAGTGGAATGATAAGAGATCTT 
GCTAATGGAATCTCAATAATAACAGGTAAATTTGGACCATTATTTAGCCCACTAATTGGAGCTATTGGGACATTTT 
TAACAGGAAGTGATACGGTTTCAAATGTTCTTTTTGGACCTTTACAAACACAAATGGCAGAAAATATTGGAGCAAA 
TCCTTACTGGCTTGCAGCAGCAAATACAACAGGAGCAACTGGAGGGAAAATGATTCTCCCCAAAACATCACAATAG 
CAACAACAACTGCTGGATTAATTGGACAAG 



f296 ,aa 

MPSPIRVFFLVLLFIFIFNPVLIAMLFILFPFILILFSFLGVFRIYFTRDYSYSRSREFEFYKLSFLLMAKLLSIL 
GTVTGEQLNYVNFIINSLNLSERGKSELYTIFHSAITKNl^ADKILYTLKLGYFQHKDLFIWLFATLKE 
KNLEAEKFISYVGVFLELESDGY5AYKDINIKIWPYSVLGLTYSASDDEVKKAYKSLVIKYHPDKFANDPVRQKD 
ANDKFIKIQDAYEKICKERNIR 

t296 .aa 

IYFTRDYSYSRSREFEFYKLSFLLMAKLLSILGTVTGEQLNYVNFIINSLNLSERGKSELYTI 

ILYTLKLGYFQHKDLFIWLFATLKEINRLSRYKNLEAEKFISYVGVFLELESDGYEAYKDINIKIWPYSVLGLTY 
SASDDEVKKAYKSLVIKYHPDKFANDPVRQKDAJJDKFIKIQDAYSKICKERNIR 

f296 .nt 

ATGCCAAGCCCAATTAGAGTGTTTTTTTTAGTGTTGTTGTTTATTTTTATTTTTAATCCCGTTTTAATAGCAATGC 
TTTTTATTTTATTTCCTTTTATTTTGATATTATTTAGTTTTTTAGGTGTTTTTAGAATATACTTTACAAGGGATTA 
CTCATATTCTAGATCTAGAGAGTTTGAATTTTATAAACTTTCTTTTTTATTAATGGCTAAATTGCTATCTATTTTA 
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GGAACTGTAACTGGGGAGCAGCTAAATTATGTCAATTTTATTATCAATTCTTTGAATTTGTCTGAACGTGGTAAAT 
CAGAATTGTATACCATTTTTCATTCTGCTATTACTAAAAATAATAATGCTGATAAAATTTTATATACCCTTAAGCT 
TGGTTATTTTCAGCACAAAGATCTTTTTATATGGCTTTTTGCCACTCTTAAAGAAATTAACAGGCTTTCTAGGTAT 
AAAAATTTAGAAGCTGAAAAATTTATTTCTTATGTTGGTGTTTTTTTAGAACTTGAATCTGATGGTTATGAAGCTT 
ATAAAGATATTAATATTAAAATTGTAAATCCTTATAGTGTTTTGGGGTTAACATATAGTGCTAGCGATGATGAGGT 
TAAAAAGGCGTATAAAAGCCTTGTTATAAAATATCATCCTGATAAGTTTGCAAATGATCCTGTAAGACAAAAAGAT 
GCAAATGATAAATTTATAAAAATTCAAGATGCTTATGAAAAAATTTGCAAGGAAAGAAATATAAGGTAA 

t296.nt 

ATATACTTTACAAGGGATTACTCATATTCTAGATCTAGAGAGTTTGAATTTTATAAACTTTCTTTTTTATTAATGG 
CTAAATTGCTATCTATTTTAGGAACTGTAACTGGGGAGCAGCTAAATTATGTCAATTTTATTATCAATTCTTTGAA 
TTTGTCTGAACGTGGTAAATCAGAATTGTATACCATTTTTCATTCTGCTATTACTAAAAATAATAATGCTGATAAA 
ATTTTATATACCCTTAAGCTTGGTTATTTTCAGCACAAAGATCTTTTTATATGGCTTTTTGCCACTCTTAAAGAAA 
TTAACAGGCTTTCTAGGTATAAAAATTTAGAAGCTGAAAAATTTATTTCTTATGTTGGTGTTTTTTTAGAACTTGA 
ATCTGATGGTTATGAAGCTTATAAAGATATTAATATTAAAATTGTAAATCCTTATAGTGTTTTGGGGTTAACATAT 
AGTGCTAGCGATGATGAGGTTAAAAAGGCGTATAAAAGCCTTGTTATAAAATATCATCCTGATAAGTTTGCAAATG 
ATCCTGTAAGACAAAAAGATGCAAATGATAAATTTATAAAAATTCAAGATGCTTATGAAAAAATTTGCAAGGAAAG 
AAATATAAGGTAA 

f3.aa 

MKKKNX.SIYMIMLISLLSCNTSDPNELTRKKMQDKNVKILGFLEKIQADNKEIVEKHIEKKEKQMVQAASVAPINV 
ESNFPYYLQEEIEIKEEELVPNTDEEKIKAEKAISDGSLEFAKLVDDENKLKNESAQLESSFNNVYKEILELADLIQ 
AEVHVAGRINSYIKKRKTTKEKEYKKREIKNKIEKQALIKLFNQLLEKRGDIENLHTQLNSGLSERASAKYFFEKA 
KETLKAAITERL^KRKNRPWWARRTHSNLAIQAKNEAEDALNQLSTSSFRILEAMKIKEDVKQLLEEVKSFLDSS 
KSKIFSSGDRLYDFLETSK 

t3 -aa 

NELTRKKMQDKNVKI LGFLEKI QADNKEI VEKHI EKKEKQMVQ AAS VAP INVESNF P YYLQEE IE I KEEELVPNTD 
EEKKAEKAISDGSLEFAKLVDDENKLKNESAQLESSFNN\AfKEILELADLIQAEVHVAGRIMSYIKKRKTTKEKEY 
KKREIKNKIEKQALIKLFNQLLEKRGDIENLHTQLNSGLSERASAKYFFEKAKETLKAAITERLMNKRKNRPWWAR 
RTHSNLAIQAKNEAEDALNQLSTSSFRILEAMKIKEDVKQLLEEVKSFLDSSKSKIFSSGDRLYDFLETSK 

f3 .nt 

ATGAAAAAAAAAAATTTATCAATTTACATGATAATGCTAATAAGTTTATTATCATGTAATACAAGTGACCCCAATG 
■ AATTAACTCGTAAAAAAATGCAAGACAAGAACGTGAAAATTTTAGGATTTTTAGAGAAAATTCAAGCAGATAATAA 
AGAAATTGTTGAAAAACATATAGAAAAAAAAGAAAAACAAATGGTGCAGGCTGCTTCTGTAGCACCTATTAATGTA 
GAGAGTAATTTCCCATATTATCTTCAAGAAGAAATAGAGATAAAAGAAGAAGAGTTGGTTCCAAATACTGATGAAG 
AAAAGAAGGCAGAGAAGGCAATTAGCGATGGGAGTCTTGAATTTGCTAAATTAGTTGATGATGAAAATAAACTTAA 
AAATGAATCTGCGCAATTAGAATCTAGTTTTAATAATGTTTATAAAGAAATCTTAGAACTTGCAGATTTAATACAA 
GCAGAGGTGCATGTTGCAGGAAGGATAAATAGCTATATAAAAAAAAGAAAGACCACTAAAGAAAAAGAATATAAGA 
AGAGAGAAATTAAGAATAAGATAGAAAAACAGGCTCTAATTAAGTTGTTCAATCAGTTATTAGAAAAAAGAGGCGA 
TATTGAAAATCTTCATACTCAATTAAATAGTGGACTTAGCGAGAGAGCATCTGCAAAATACTTTTTTGAGAAAGCC 
AAAGAAACTTTAAAAGCTGCTATTACTGAAAGATTAAATAACAAACGTAAAAATCGGCCATGGTGGGCAAGAAGAA 
CACATAGTAATTTAGCAATACAGGCAAAAAATGAGGCAGAGGATGCTTTAAACCAATTAAGTACTTCTTCTTTTAG 
GATACTTGAAGCAATGAAAATAAAGGAAGATGTAAAACAGCTTCTTGAAGAAGTAAAATCTTTTCTAGATTCTTCA 
AAGAGCAAAATCTTTTCTAGTGGCGATAGATTATATGATTTTTTAGAGACGAGTAAATAA 

. t3.nt 

*AATGAATTAACTCGTAAAAAAATGCAAGACAAGAACGTGAAAATTTTAGGATTTTTAGAGAAAATTCAAGCAGATA 
ATAAAGAAATTGTTG AAAAAC AT ATAGAAAAAAAA G AAAAAC AAATGGTGC AGGCTGC TTC TGTAGCACC TATTAA 
TGTAGAGAGTAATTTCCCATATTATCTTCAAGAAGAAATAGAGATAAAAGAAGAAGAGTTGGTTCCAAATACTGAT 
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GAAGAAAAGAAGGCAGAGAAGGCAATTAGCGATGGGAGTCTTGAATTTGCTAAATTAGTTGATGATGAAAATAAAC 
TTAAAAATGAATCTGCGCAATTAGAATCTAGTTTTAATAATGTTTATAAAGAAATCTTAGAACTTGCAGATTTAAT 
ACAAGCAGAGGTGCATGTTGCAGGAAGGATAAATAGCTATATAAAAAAAAGAAAGACCACTAAAGAAAAAGAATAT 
AAGAAGAGAGAAATTAAGAATAAGATAGAAAAACAGGCTCTAATTAAGTTGTTCAATCAGTTATTAGAAAAAAGAG 
GCGATATTGAAAATCTTCATACTCAATTAAATAGTGGACTTAGCGAGAGAGCATCTGCAAAATACTTTTTTGAGAA 
AGCCAAAGAAACTTTAAAAGCTGCTATTACTGAAAGATTAAATAACAAACGTAAAAATCGGCCATGGTGGGCAAGA 
AGAACACATAGTAATTTAGCAATACAGGCAAAAAATGAGGCAGAGGATGCTTTAAACCAATTAAGTACTTCTTCTT 
TTAGGATACTTGAAGCAATGAAAATAAAGGAAGATGTAAAACAGCTTCTTGAAGAAGTAAAATCTTTTCTAGATTC 
TTCAAAGAGCAAAATCTTTTCTAGTGGCGATAGATTATATGATTTTTTAGAGACGAGTAAATAA 

f30 .aa 

MNKKILTLLVLILSISSVLMLSKSITKKSKYKIIRDYFINSNYVLVKIENKDLKFTISKPIYDKKLNNYFFKGQTT 
SHFLISNNVDIAINTSPYEVKQNMFFPKGLYIYNKKM 

LIKNGKYKKNFKETRHPRTIIGTDKNNKHLFLVTIEGRGVNNSKGASLNEAIDFALSYGMTNAINLDGGGSSTLW 
KSNNAPYKLNFTANIFGQERPVPFHLGIKLPN 

t3 0.aa 

LSKSITKKSKYKIIRDYFINSNWLVKIENKDLKFTISKPIYDKKLNNYFFKGQTTSHFLISNNVDIAINTSPYEV 
KQNMFFPKGLYIYNKKMISKQIMSTYGEIVIKHNKIILNPKEDEIENCDYGFSGFFVLIKNGKYKKNFKETRHPRTI 
IGTDKNNKHLFLVTIEGRGVNNSKGASLNEAIDFALSYGMTNAINLDGGGSSTLWKSNNAPYKLNFTANIPGQER 
PVPFHLGIKLPN 

f 30 .nt 

ATGAATAAAAAAATATTAACACTGCTAGTATTGATTTTAAGTATTTCATCAGTACTAATGCTGTCCAAATCAATCA 
CCAAAAAATCCAAATACAAAATTATTAGGGATTATTTCATAAACAGCAATTATGTTCTGGTGAAAATTGAAAATAA 
AGATCTAAAATTTACCATATCAAAACCTATTTACGACAAAAAGCTAAATAATTACTTCTTTAAAGGCCAAACAACA 
AGCCATTTCTTAATTTCTAACAATGTTGACATTGCAATTAACACAAGTCCATACGAAGTTAAACAAAACATGTTTT 
TCCCAAAAGGACTATACATATATAATAAAAAAATGATTTCAAAACAAATAAATAACTACGGAGAGATTGTAATAAA 
GCACAACAAAATTATATTAAATCCCAAGGAAGACGAAATAGAAAACTGCGATTATGGATTTAGCGGATTTTTTGTT 
TTAATCAAAAACGGAAAGTATAAAAAAAATTTTAAAGAAACAAGGCACCCAAGAACAATAATAGGAACTGATAAAA 
ATAACAAGCATTTATTTCTTGTTACAATAGAAGGAAGGGGTGTCAATAATAGCAAAGGGGCCTCTCTTAATGAAGC 
TATTGATTTTGCATTAAGCTACGGCATGACTAACGCTATTAATCTAGACGGGGGGGGCTCAAGCACTCTTGTTGTA 
AAATCAAATAACGCTCCTTACAAATTAAACTTCACAGCAAACATCTTTGGACAGGAAAGACCTGTCCCATTTCATT 
T AGGAAT AAAACTTCC TAATTG A 

t30 .nt 

CTGTCCAAATCAATCACCAAAAAATCCAAATACAAAATTATTAGGGATTATTTCATAAACAGCAATTATGTTCTGG 
TGAAAATTGAAAATAAAGATCTAAAATTTACCATATCAAAACCTATTTACGACAAAAAGCTAAATAATTACTTCTT 
TAAAGGCCAAACAACAAGCCATTTCTTAATTTCTAACAATGTTGACATTGCAATTAACACAAGTCCATACGAAGTT 
AAACAAAACATGTTTTTCCCAAAAGGACTATACATATATAATAAAAAAATGATTTCAJiAACAAATAAATAACTACG 
GAGAGATTGTAATAAAGCACAACAAAATTATATTAAATCCCAAGGAAGACGAAATAGAAAACTGCGATTATGGATT 
TAGCGGATTTTTTGTTTTAATCAAAAACGGAAAGTATAAAAAAAATTTTAAAGAAACAAGGCACCCAAGAACAATA 
ATAGGAACTGATAAAAATAACAAGCATTTATTTCTTGTTACAATAGAAGGAAGGGGTGTCAATAATAGCAAAGGGG 
CCTCTCTTAATGAAGCTATTGATTTTGCATTAAGCTACGGCATGACTAACGCTATTAATCTAGACGGGGGGGGCTC 
AAGCACTCTTGTTGTAAAATCAAATAACGCTCCTTACAAATTAAACTTCACAGCAAACATCTTTGG^CAGGAAAGA 
- CCTGTCCCATTTCATTTAGGAATAAAACTTCCTAATTGA 

f308 .aa 

MQLLKNKYPFKPALLDLFLVYAIV^LASPFVNA/NSEFWWVDENKFYFWISRSFLIIFIIYFFKLTSSYDDFRVEFF 
IPKFKFIFLTOSVXIFIKTILIAMIVTFLIAFLLEYLLPESVLV^/YFQNNAGFNWKISSKKAFFLMTFTSFFTGAF 




TABLE 1. Nucleotide and Amino Acid Sequences 

EELFYRAFVITKFTQMGFPWATAI LS SMFFAYGHLYYGI LGFLVTFILGI FFAFTYLRYKNVYYVIFIHSFYNI I 
VSSLLLFLN 

t308 .aa 

NSEFWNVDENHFYFWISRSFLIIFIIYFFKLTSSYDDFRVEFFIPKFKFIFLWDSVLIFIKTILIAMIVIFLIAFL 
LEYLLPESVLWYFQNNAGFNWKISSKKAFFLMTFTSFFTGAFEELFYRAFVITKFTQMGFPWATAILSSMFFAY 
GHLYYGILGFLVTFILGIFFAFTYLRYKNVYYVIFIHSFYNIIVSSLLLFLN 

■f308 .nt 

ATGCAATTGTTAAAAAATAAATATCCATTCAAGCGGGCTTTGCTTGATCTTTTTTTGGTCTATGCTATTGTTTATT 
TGGCATCTCCTTTTGTAAATGTTAATTCAGAATTTTGGAATGTTGATGAAAATCATTTTTATTTTTGGATTTCAAG 
ATCTTTTTTAATTATTTTTATAATTTATTTTTTTAAACTTACCAGTTCTTATGATGATTTTAGAGTAGAGTTTTTT 
ATTCCTAAATTTAAATTTATTTTTCTTTGGGATTCTGTTTTAATTTTTATTAAAACAATATTGATTGCAATGATAG 
TCATTTTTTTAATAGCTTTTTTGCTTGAATATTTGTTGCCAGAATCGGTACTTGTCTATTATTTTCAAAACAATGC 
TGGATTTAATTGGAAGATTAGCAGTAAAAAAGCATTTTTTTTAATGACTTTTACCTCTTTTTTTACAGGAGCTTTT 
GAAGAACTTTTTTACAGGGCTTTTGTTATTACTAAGTTTACACAAATGGGATTTCCTGTTGTAGCTACCGCCATTC 
TTAGTAGTATGTTTTTTGCTTATGGGCATTTATATTATGGAATTTTAGGATTTTTGGTTACATTTATATTAGGGAT 
ATTTTTTGCTTTTACTTATTTAAGGTATAAAAATGTATATTATGTGATTTTTATACATAGTTTTTATAATATTATT 
GTTAGCAGCTTGTTGCTTTTTTTGAATTAA 

t308 .nt 

AATTCAGAATTTTGGAATGTTGATGAAAATCATTTTTATTTTTGGATTTCAAGATCTTTTTTAATTATTTTTATAA 
TTTATTTTTTTAAACTTACCAGTTCTTATGATGATTTTAGAGTAGAGTTTTTTATTCCTAAATTTAAATTTATTTT 
TCTTTGGGATTCTGTTTTAATTTTTATTAAAACAATATTGATTGCAATGATAGTCATTTTTTTAA.TAGCTTTTTTG 
CTTGAATATTTGTTGCCAGAATCGGTACTTGTCTATTATTTTCAAAACAATGCTGGATTTAATTGGAAGATTAGCA 
GTAAAAAAGCATTTTTTTTAATGACTTTTACCTCTTTTTTTACAGGAGCTTTTGAAGAACTTTTTTACAGGGCTTT 
TGTTATTACTAAGTTTACACAAATGGGATTTCCTGTTGTAGCTACCGCCATTCTTAGTAGTATGTTTTTTGCTTAT 
GGGCATTTATATTATGGAATTTTAGGATTTTTGGTTACATTTATATTAGGGATATTTTTTGCTTTTACTTATTTAA 
GGTATAAAAATGTATATTATGTGATTTTTATACATAGTTTTTATAATATTATTGTTAGCAGCTTGTTGCTTTTTTT 
GAATTAA 

f31.aa 

MKKYLFFILFLISSNNLIVSYPLSFGGGFSYQFTNYTDKTGATKFAPNFTRADHGINLNLFFDANYVLFEMSYKEA 
FWTHNGRYFSLGLYGTYPMVFKEQVRMLFPLIGFKYAFDLSSNNFNLFFLSMGLAADLFIPDLDGLYIRPLFMLS 
ISPFSNYKNFSGLTTEIMLGFNIGWRFFN 

t31.aa 

IVSYPLSFGGGFSYQFTNYTDKTGAa?KFAPNFTPJffiHGINLNLFFDANYVLFEMSYKEAFWTHNGRYFSLGLYGT 
YPWFKEQVRMLFPLIGFKYAFDLSSl^FWLFFLSMGLAADLFIPDLDGLYIRPLFMLSISPFSNYKNFSGLTTEI 
MLGFNIGWRFFN 

f31.nt 

ATGAAGAAATATCTTTTTTTTATTTTATTTCTCATCTCTTCTAATAATTTAATTGTTTCTTATCCACTTTCTTTTG 
GTGGAGGTTTTTCTTATCAATTTACTAATTATACTGATAAAACAGGCGCCACTAAATTTGCTCCAAATTTTACCAG 
AGCAGATCATGGGATTAATTTGAATTTATTTTTTGATGCAAATTATGTACTTTTTGAAATGTCTTACAAAGAGGCT 
TTTGTTGTTACTCACAATGGGAGATATTTCTCGCTTGGGCTTTATGGAACATATCCAATGGTTTTCAAAGAGCAGG 
TTAGAATGCTTTTCCCATTAATTGGGTTTAAATATGCTTTTGATTTAAGCTCTAATAACTTCAATCTCTTTTTTTT 
AAGCATGGGGCTTGCTGCTGATCTTTTTATTCCCGATCTTGATGGTTTATATATTAGGCCTTTGTTTATGCTTTCT 
ATTTCTCCATTTTCTAATTATAAAAATTTTTCTGGGTTAACAACTGAGATTATGCTTGGATTTAATATCGGTTGGA 
GATTTTTCAATTAG 
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t31.nt 

ATTGTTTCTTATCCACTTTCTTTTGGTGGAGGTTTTTCTTATCAATTTACTAATC , ATACCGACAA^_-lC.-:: 
CTAAATTTGCTCCAAATTTTACCAGAGCAGATCATGGGATTAATTTGAATTTATCTTTTGATGCAAArrj 
TTTTGAAATGTCTTACAAAGAGGCTTTTGTTGTTACTCACAATGGGAGATA~TTCTCGC r ITGGGCT r ?TA" 
TATCCAATGGTTTTCAAAGAGCAGGTTAGAATGCTTTTCCCATTAATTGGGTTTAAArA'rGCT'TTCGAr'' 
CTAATAACTTCAATCTCTTTTTTTTAAGCATGGGGC TTGCTGCTGATCTTTCTACTCCCGATCTTGATGC 
TATTAGGCCTTTGTTTATGCTTTCTATTTCTCCATTTTCTAATTATAAAAACTTC'TCTGGGT^AACAAC' 
ATGCTTGGATTTAATATCGGTTGGAGATTTTTCAATTAG 



MKQKYENYFKKRLILNLLIFLLLACSSESIFSQLGNLQ 

iekidlsnsyefindivnisgktyllaqnkeeelevcelngkdwtlk^ 
nnrekifdlqgsdktppqatendkfyqisneenlitgnslkiwqmi^^ 
vtctggy^mdtkfkwsntnnyttpifiqdevgefssyfar^ 
pgayngsqlsktglndiipvsnntiyiltqgkglwklenrkltke 

f939 .aa 

CSSESIFSQLGNLQKIKHEYNILGSSSPRGISLVGETLYIAAMHLFKKENGKrEKIDLS^rSYZFIZOI-^TZSGKTY 

LLAQNKEEELEVCELNGKDWTLKFKKPLKAYKFLKSVGRDGVKEAYIL^^ 

FYQISNEENLITGNSLKIWQMNNNTYTNIDYQQ 

PIFIQDEVGEFSSYFAREFNDAILIGSNNGFAEFTKNKEGIFALRAPSKSVZPGAYNGSQLSKjGLiroiZPVSNirr 

I YI LTQGKGLWKLENR 

KLTKE 

f939 .nt 

ATGAAACAAAAATACGAAAACTATTTTAAAAAAAGATTAATTTTAAACCTACTAATACCC'rTACTACTAGCA.TGCC 
CAAGCGAATCCATATTTTCACAATTAGGAAATCTGCAAAAAATAAAACATGAATACAACArTrCGGGCAGTCCAAG 
TCCAAGAGGAATTTCTCTAGTAGGAGAAACTCTCTACATTGCAGCC^ 

ATTGAAAAAATTGATTTGAGCAATTCTTATGAGTTTATAAACGACATTGTAAATATACCCGGAAAAACCrArCTZT 
TAGCGCAAAACAAAGAAGAAGAATTAGAAGTTTGCGAGCTAAATGGAAAAGATT<^^^ 
■ GCTAAAAGCATATAAATTCTTAAAATCCGTAGGAAGAGATGGCGTAAAAGAAGCATACATCTCAGCTArAGALTAAA 
AATAATCGTGAGAAAATTTTTGATCTACAAGGATCTGACAAAACACCACCACP> AGCT ACTG» AAATGACA AATTTT 
ATCAAATATCAAATGAAGAAAACTTAATTACAGGAAATTCACTCAAAATATGGCAAACGAATAACAATACA'TACAC 
AAACATAGACTATCAACAGGCCAAAGAAATAATGCCTATCATTAAAACA^GCATCAC-GGGCTC^TCTGAAGrTTCA 
GTAATGACTGGTGGTTACAATAATTTAGATACAAAATTTAAAGTTTACTCAAATACAAArAJi-CCACACAACGCCAA 
TATTTATTCAAGACGAAGTAGGCGAATTTAGCAGCTACTTTGCAAGAGAAT'TTAATGATGCGArATTAArCGGAAG 
T AAT AATGGATTTGCAGAATTTAC AAAAAATAAAGAAGG AATTTTTGC CCTACGGGC AC C C TC AAAAT CTGrAGAA 
CCTGGAGCTTATAACGGATCTCAGCTAAGCAAAACAGGCCTTAATGATATTATTCCTGTATCAAACAACACGATr'T 
ACATATTAACTCAGGGCAAGGGTTTGTGGAAATTGGAAAACAGAAAATTAACTAAAGAATAA 

t939 .nt 

TGCTCAAGCGAATCCATATTTTCACAATTAGGAAATCTGCAAAAAATA^ 
CAAGTCCAAGAGGAATTTCTCTAGTAGGAGAAACTCTCTACATTGCAGCCAr^^ 

CAAGATTGAAAAAATTGATTTGAGCAATTCTTATGAGTTTATAAACGACATTGTAAArACATCTGGAAAAACCTAr 

CTTTTAGCGCAAAACAAAGAAGAAGAATTAGAAGTTTGCGAGCTAAATGGAAAAGATTGGACA7CAAAATCTAAAA 

AACCGCTAAAAGCATATAAATTCTTAAAATCCGTAGGAAGAGATGGCGTAAAA 

TAAAAATAATCGTGAGAAAATTTTTGATCTACAAGGATCTGACA^ 

TTTTATCAAATATCAAATGAAGAAAACTTAATTACAGGAAATTCACTC 
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ACACAAACATAGACTATCAACAGGCCAAAGAAATAATGCCTATCATTAAAACAAGCATTAGGGGCTCTTCTGAAGT 
TTTAGTAATGACTGGTGGTTACAATAATTTAGATACAAAATTTAAAGTTTACTCAAATACAAATAATTACACAACG 
CCAATATTTATTCAAGACGAAGTAGGCGAATTTAGCAGCTACTTTGCAAGAGAATTTAATGATGCGATATTAATCG 
GAAGTAATAATGGATTTGCAGAATTTACAAAAAATAAAGAAGGAATTTTTGCCCTACGGGCACCCTCAAAATCTGT 
AGAACCTGGAGCTTATAACGGATCTCAGCTAAGCAAAACAGGCCTTAATGATATTATTCCTGTATCAAACAACACG 
ATTTACATATTAACTCAGGGCAAGGGTTTGTGGAAATTGGAAAACAGAAAATTAACTAAAGAATAA 

f739 .aa 

MQSGLKIKLILFFCCFACSCDINYPEIKELDYKINYYFTENRLDYSMSFDFAIKVINSKDVFKLSIENKJNITNEF-Q 
VINNNYSSFFIDSSLGKDILYCKDLRFNFFDKTFEDFTSCVRLFDKGMRVYNRELVISLGMSKYDLDDVHNYVYKS 
KDMEMLWKLSNSKVFFVKTYKDKLHPVSSWRIDSIDILEIDKAFDNYISFYYVEKNSNLFFKVG 

t739.aa 

?3 CCFACSCDINYPEIXELDYKINYYFTENRLDYSMSFDFAIKVINSKDVFKLSIENKNTNEFIQVINWNYSSFFIDS 
"> SLGKDILYCKDLRFNFFDKTFEDFTSCVRLFDKGMRVYNRELVISLGMSKYDLDDVHNYVYKSKDMEMLNKLiSNSK 
?!!! VFFVKTYKDKLHPVSSWRIDSIDILEIDKAFDNYISFYYVEKNSNLFFKVG 

UJ 

p f739.nt 

„; ATGCAGAGCGGATTAAAAATTAAATTAATATTGTTTTTTTGTTGTTTTGCTTGTTCTTGCGACATAAATTATCCGG 
p ;| AGAT AAAAGAGC TTGATTAT AAGATAAATTATTATTTTACTGAAAATCGC TTAGATTAC TC TATGAGTTTTG ATTT 

J' TGCAATTAAAGTTATAAATTCAAAAGATGTTTTTAAATTATCAATAGAGAATAAGAACACTAATGAGTTTATTCAA 
P| GTGATTAATAATAATTATAGCTCTTTTTTTATTGATTCTAGCCTTGGAAAGGATATTCTATATTGTAAGGATTTGA 
GGTTTAATTTTTTTGATAAAACTTTTGAAGATTTTACCTCATGTGTTCGTCTTTTTGATAAGGGCATGAGAGTATA 
CAATAGAGAGCTTGTTATTTCTTTGGGTATGTCAAAATATGATTTAGATGATGTTCACAATTATGTATATAAGTCT 
:^ ! : AAAGATATGGAAATGTTAAACAAGTTAAGCAATTCCAAAGTATTTTTTGTTAAAACTTATAAAGACAAACTACATC 
^ CGGTCTCTTCAGTTGTTAGAATTGATTCAATAGATATTCTAGAGATTGATAAAGCATTTGATAA'TTACATAAGTTT 
. TTATTATGTCGAAAAAAATTCAAATCTTTTTTTTAAAGTTGGCTGA - 

t7 3 9."nt 

TGTTGTTTTGCTTGTTCTTGCGACATAAATTATCCGGAGATAAAAGAGCTTGATTATAAGATAAATTATTATTTTA 
CTGAAAATCGCTTAGATTACTCTATGAGTTTTGATTTTGCAATTAAAGTTATAAATTCAAAAGATGTTTTTAAATT 
ATCAATAGAGAATAAGAACACTAATGAGTTTATTCAAGTGATTAATAATAATTATAGCTCTTTTTTTATTGATTCT 
AGCCTTGGAAAGGATATTCTATATTGTAAGGATTTGAGGTTTAATTTTTTTGATAAAACTTTTGAAGATTTTACCT 
CATGTGTTCGTCTTTTTGATAAGGGCATGAGAGTATACAATAGAGAGCTTGTTATTTCTTTGGGTATGTCAAAATA 
TGATTTAGATGATGTTCACAATTATGTATATAAGTCTAAAGATATGGAAATGTTAAACAAGTTAAGCAATTCCAAA 
GTATTTTTTGTTAAAACTTATAAAGACAAACTACATCCGGTCTCTTCAGTTGTTAGAATTGATTCAATAGATATTC 
TAGAGATTGATAAAGCATTTGATAATTACATAAGTTTTTATTATGTCGAAAAAAATTCAAATCTTTTTTTTAAAGT 
TGGCTGA 

f742 .aa 

MNKKHTNFSVLLLLIFLLILSFGGFGYYIYQSKLNDKNREIMLNEVKNSVIDRNYKKAYSVAKLLQDKYPQNEDIA 
MLTNTLASIANSSPFESKDLQRDSANQILDKIKGQDNTKT^ENFDIAFNWRYIKDSTITENYSDRNDDVGIEDE 
DISEFKKSKIPEKIKPNTNPKEEDQIIQSPNPKLSVM)QKNLFNLEKLKKNLSGKSNSENILNDSQKIENDKQNTN 
LSKEKNSENILKTPDNSKYSNNNNTTSLKKISSNSQKESELSPPSQTIIGKIYRPYSYLIKKELYEILDDINTGRV 
TLGKNRLKELIKKGLSNKFQKVNELIENSKNKEASNLLLTLIKKDIEPNLINIPKDPYKKEIFQLDKEDKKPQYLE 
DLKSKVHSIKPIDLENTKSRQQAIKDLNEFLKNNPNDAQASKTLAQANKIQHLEDLKSKVHSIKPIDLENTKSRQQ 
AIKDLNEFLKNNPNDAQASKTLAQANKIQHLEDLKSKVHSIKPIDLENTKSRQQAIXDLWEFLKNNPNDAQASKTL 
AQANKIQHLEDLKSKVHSIKPIDLENTKSRQQAIKDLNEFXKNNPNDAQASKTLAQANKIQHLEDLKSKVHSIKPI 
DLENTKSRQQAIKDLNEFXKmTPNDAQASKTLAQANKIQHLEDLKSKVHSIKPIDLENTKSRQQAIKDLNEFLKWN 
PNDAQASXTLAQANKIQHLEDLKSKVHSIKPIDLENTKSRQQAIKDLNEFXKNNPNDAQASKTLAQAYENNGDLLK 
AENAYEKIIKLTNTQEDHYKLGIIRFKLKKYEHSIESFDQTIKLDPKHKKALHNKGIALMMLNKNKKAIESFEKAT 
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QIDKSryGTAYYQKGIAEEKNGDMOQAFASFiasrAYNLDKNPNYALKAGIVSNNLGNFKQSEEYLNFFNiyNrAKKPNEI 
AIYNLSIAKFENNKLEESLETINKAIDLNPEKSEYLYLKASINLKKE^TYQNAISLYSLVISKNPENTSAYINLAKA 
YEKSGNKSQAISTLEKIINKNNKLALNNLGILYKKEKNYQKAIEIFEKAIINSDIEAKYWLATTLIEINDNTxRAKD 
LLREYTKLKPNNPSALHALGI I E YNENNNDQTLREL I KKF PNYKKNENI KKIIGI 

t742 .aa 



KLNDKNREIMLNEVKNSVIDRNYKKAYSVAKLLQDKYPQNEDIAMLTNTLAEIANSSPFESKDLQRDSANQILDKI 
KGQD 

NTKTNVNENFDIAFNNRYIKDSTITENYSDRNDDVGIEDEDISEFKKSKIPEKIKPNTNPKEEDQIIQSPNPKLSV 
NDQKNLFNLEKLKKNLSGKSNSENILNDSQKIENDKQNTNLSKEKNSENILKTPDNSKYSNNNNTTSLKKISSNSQ 
KESELSPPSQTIIGKIYRPYSYLIKKELYEILDDINTGRVTLGKNRLKELIKKGLSNKFQKVNELIENSKNKE^SN 
LLLTLIKKDIEPNLINIPKDPYKKEIFQLDKEDKKPQYLEDLKSKVHSIKPIDLENTKSRQQAIKDLNEFLKNNPN 
DAQASKTLAQANKIQHLEDLKSKVHSIKPIDLENTKSRQQAIKDLNEFLKNNPNDAQASKTLAQANKIQHLEDLKS 
KVHSIKPIDLENTKSRQQAIKDLNEFLKNNPNDAQASKTLAQANKIQHLEDLKSKVHSIKPIDLENTKSRQQAIKD 
LNEFXKNNPNDAQASKTLAQANKIQHLEDLKSKVHSIKPIDLENTKSRQQAIKDLNEFXKNNPNDAQASKTLAQAK 
KIQHLEDLKSKVHSIKPIDLENTKSRQQAIKDLNEFLXNNPNDAQASKTLAQANKIQHLEDLKSKA/HSIKPIDLEN 
TKSRQQAIKDLNEFXKMNPNDAQASKTLAQAYENNGDLLKAENAYEKIIKLTNTQEDHYKLGIIRFKLKKYEHSIE 
SFDQTIKLDPKHKKALHNKGIALMMLNKNK^ 

DKNPNYALKAGIVSNNLGNFKQSEEYLNFFNANAKKPNEIAIYNLSIAKFENNKLEESLETINKAIDLNPEKSEYL 
YLKASINLKKENYQWAISLYSLVIEKNPENTSAYINLAKAYEKSGNKSQAISTLEKIINKNNKLALNNLGILYKKE 
■ KNYQKAIEIFEKAIINSDIEAKYNLATTLIEINDNTRAKDLLREYTKLKPNWPEALHALGIIEYNENI-JNDQTLREL 
I KKF PNYKKNENI KKI IGI 

f'742.nt 

ATGAATAAAAAACATACAAATTTTTCGGTATTATTGCTTTTAATTTTCTTACTTATCTTATCATTTGGGGGCTTTG 
GTTACTATATATATCAAAGCAAATTAAATGACAAAAATCGAGAAATAATGCTAAACGAAGTTAAAAATAGCGTAAT 
AGATCGAAACTATAAAAAAGCATATTCTGTTGCAAAACTTCTGCAAGACAAATACCCCCAAAATGAAGACATTGCA 
ATGCTTACAAATACACTAGCAGAAATTGCCAACAGTAGTCCTTTTGAATCAAAAGACTTGCAAAGAGATTCTGCTA 
ATCAAATCTTAGACAAGATCAAAGGTCAAGACAATACAAAAACAAATGTAAACGAAAATTTTGATATAGCATTTAA 
TAATAGATACATTAAAGACAGCACAATAACAGAAAACTACTCTGACAGAAACGATGATGTTGGCATTGAAGATGAA 
GACATATCTGAATTTAAAAAAAGCAAAATCCCAGAAAAAATAAAACCAAATACAAACCCAAAAGAAGAAGACCAAA 
TAATACAATCTCCAAATCCGAAATTAAGTGTTAATGACCAAAAAAATTTATTTAATTTGGAAAAACTAAAAAAAAA 
TTTAAGTGGAAAATCAAATAGTGAAAATATTTTAAACGATTCTCAAAAAATAGAAAATGATAAGCAAAACACAAAT 
TTATCCAAAGAAAAAAATTCGGAGAATATTTTAAAAACTCCGGACAACAGTAAATATTCAAACAATAACAATACTA 
CATCTTTAAAAAAAATTTCTTCAAATTCCCAAAAAGAAAGTGAGCTTTCTCCACCCAGTCAAACAATAATAGGGAA 
AATTTATAGGCCATATAGCTACTTGATAAAAAAAGAGCTCTATGAAATATTAGACGATATTAATACCGGAAGAGTC 
ACACTTGGAAAAAACAGATTAAAAGAATTAATTAAAAAAGGTCTAAGCAACAAATTTCAAAAAGTAAATGAATTGA 
TTGAAAATTCAAAAAATAAAGAAGCTTCAAATTTACTATTAACCTTAATAAAAAAAGATATTGAACCAAATCTCAT 
TAATATACCAAAAGATCCTTACAAAAAAGAAATTTTTCAATTAGATAAAGAAGACAAAAAGCCTCAGTACCTAGAG 
GACCTTAAATCTAAAGTTCATTCAATAAAACCCATTGATCTTGAAAACACAAAATCACGCCAACAAGCCATTAAGG 
ATCTAAACGAATTCTTGAAAAACAATCCCAATGACGCTCAGGCCTCTAAAACTTTAGCTCAAGCTAATAAAATACA 
ACACCTAGAGGACCTTAAATCTAAGGTTCATTCAATAAAACCCATTGATCTTGAAAACACAAAATCACGCCAACAA 
GCCATTAAGGATCTAAACGAATTCTTGAAAAACAATCCCAATGACGCTCAGGCCTCTAAAACTTTAGCTCAAGCTA 
ATAAAATACAACACCTAGAGGACCTTAAATCTAAGGTTCATTCAATAAAACCCATTGATCTTGAAAACACAAAATC 
ACGCCAACAAGCCATTAAGGATCTAAACGAATTCTTAAAAAACAATCCCAATGACGCCCAGGCCTCTAAAACTTTA 
GCTCAAGCTAATAAAATACAACACCTGGAGGACCTTAAATCTAAGGTTCATTCAATAAAACCCATTGATCTTGAAA 
ACACAAAATCACGCCAACAAGCCATTAAGGATCTAAACGAATTCTTAAAAACAATCCCAATGACGCCAGGCCTCTA 
AAACTTTAGCTCAAGCTAATAAAATACAACACCTGAGGACCTTAAATCTAAGGTTCATTCAATAAAACCCATTGAT 
CTTGAAAACACAAAATCACGCCAACAAGCCATTAAGGATCTAAACGAATTCTTAAAAACAATCCCAATGACGCCAG 
GCCTCTAAAACTTTAGCTCAAGCTAATAAAATACAACACCTAGAGGACCTTAAATCTAAGGTTCATTCAA'T'AAAAC 
CCATTGATCTTGAAAACACAAAAT 

CACGCCAACAAGCCATTAAGGATCTAAACGAATTCTTAAAAAACAATCCCAATGACGCCCAGGCCTCTAAAACTTT 
AGCTCAAGCTAATAAAATACAACACCTGGAGGACCTTAAATCTAAGGTTCATTCAATAAAACCCATTGATCTTGAA 
AACACAAAATCACGCCAACAAGCCATTAAGGATCTAAACGAATTCTTAAAAACAATCCCAATGACGCCCAGGCCTC 
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TAAAACTTTAGCTCAAGCTTATGAAAACAATGGAGATTTGCTAAAAGCAGAAAATGCATACGAAAAAATTATCAAA 
CTCACAAATACCCAAGAAGATCACTATAAACTTGGAATCATTAGATTCAAGCTTAAAAAGTATGAACAC^CAATAG 
AATCATTTGATCAAACAATAAAACTCGACCCAAAACATAAAAAAGCACTTCATAACAAAGGAATAGCTTTAATGAT 
GCTAAATAAAAACAAAAAAGCAATAGAATCTTTTGAGAAAGCAATACAAATTGATAAAAATTATGGCACCGCCTAC 
TACCAAAAAGGAATAGCAGAAGAAAAAAATGGCGATATGCAACAAGCATTTGCAAGCTTTAAAAATGCCTACAATC 
TCGACAAAAACCCCAATTATGCATTAAAAGCAGGAATAGTATCAAATAACTTGGGCAACTTCAAACAAAGTGAAGA 
GTATTTAAATTTTTTTAATGCCAATGCAAAAAAACCTAACGAAATTGCTATTTACAACCTATCAATAGCAAAATTT 
GAAAACAATAAACTTGAAGAATCTCTTGAAACAATAAACAAAGCCATAGATTTAAATCCAGAAAAAAGTGAA r PATT' 
TATATTTAAAAGCATCTATAAATCTTAAAAAAGAAAATTACCAAAATGCTATATCACTTTACAGCTT^GTAATTGA 
AAAAAACCCTGAAAATACTTCAGCCTATATAAACCTGGCAAAAGCATATGAAAAATCAGGAAATAAAAGTCAAGCA 
ATCTCAACTCTTGAAAAGATAATAAACAAAAATAATAAATTAGCCTTAAACAATCTTGGGATACTTTACAAAAAAG 
AAAAAAATTATCAAAAAGCAATTGAAATTTTTGAAAAAGCAATAATCAATTCAGATATTGAAGCAAAATA^AATC^ 
TGCAACCACTCTAATTGAAATTAATGATAACACAAGAGCTAAAGACCTTCTAAGAGAATATACAAAATTAAAACCA 
AACAATCCAGAGGCCTTACATGCACTAGGAATAATAGAATATAATGAAAATAACAATGATCAAACACTAAGAC-AAC 
TTATAAAAAAATTTCCAAATTACAAAAAAAATGAAAATATTAAAAAAATAATAGGAATATAA 

t742 .nt 



AAATTAAATGACAAAAATCGAGAAATAATGCTAAACGAAGTTAAAAATAGCGTAATAGATCGAAACTATAAAAAAG 
CATATTCTGTTGCAAAACTTCTGCAAGACAAATACCCCCAAAATGAAGACATTGCAATGCTTACAAATA.CACTAGC 
AGAAATTGCCAACAGTAGTCCTTTTGAATCAAAAGACTTGCAAAGAGATTCTGCTAATCAAATCTTAGACAAGATC 
AAAGGTCAAGACAATACAAAAACAAATGTAAACGAAAATTTTGATATAGCATTTAATAATAGATACATTAAAGACA 
GCACAATAACAGAAAACTACTCTGACAGAAACGATGATGTTGGCATTGAAGATGAAGACATATCTGAATTTAAAAA 
AAGCAAAATCCCAGAAAAAATAAAACCAAATACAAACCCAAAAGAAGAAGACCAAATAATACAATCTCCAAATCCG 
AAATTAAGTGTTAATGACCAAAAAAATTTATTTAATTTGGAAAAACTAAAAAAAAATTTAAGTGGAAAATCAAATA 
GTGAAAATATTTTAAACGATTCTCAAAAAATAGAAAATGATAAGGAAAACACAAATTTATCCAAAGAAAAAAATTC 
GGAGAATATTTTAAAAACTCCGGACAACAGTAAATATTCAAACAATAACAATACTACATCTTTAAAAAAAATTTCT 
TCAAATTCCCAAAAAGAAAGTGAGCTTTCTCCACCCAGTCAAACAATAATAGGGAAAATTTATAGGCCATATAGC^ 
ACTTGATAAAAAAAGAGCTCTATGAAATATTAGACGATATTAATACCGGAAGAGTCACACTTGGAAAAAACAGATT 
AAAAGAATTAATTAAAAAAGGTCTAAGCAACAAATTTCAAAAAGTAAATGAATTGATTGAAAATTCAAAAAATAAA 
GAAGCTTCAAATTTACTATTAACCTTAATAAAAAAAGATATTGAACCAAATCTCATTAATATACCAAAAGATCC^T 
ACAAAAAAGAAATTTTTCAATTAGATAAAGAAGACAAAAAGCCTCAGTACCTAGAGGACCTTAAATCTAAAGTTCA 
TTCAATAAAACCCATTGATCTTGAAAACACAAAATCACGCCAACAAGCCATTAAGGATCTAAACGAATTCTTGAAA 
AACAATCCCAATGACGCTCAGGCCTCTAAAACTTTAGCTCAAGCTAATAAAATACAACACCTAGAGGACCTTAAAT 
CTAAGGTTCATTCAATAAAACCCATTGATCTTGAAAACACAAAATCACGCCAACAAGCCATTAAGGATCTAAACGA 
ATTCTTGAAAAACAATCCCAATGACGCTCAGGCCTCTAAAACTTTAGCTCAAGCTAATAAAATACAACACCTAGAG 
GACCTTAAATCTAAGGTTCATTCAATAAAACCCATTGATCTTGAAAACACAAAATCACGCCAACAAGCCATTAAGG 
ATCTAAACGAATTCTTAAAAAACAATCCCAATGACGCCCAGGCCTCTAAAACTTTAGCTCAAGCTAATAAAATACA 
ACACCTGGAGGACCTTAAATCTAAGGTTCATTCAATAAAACCCATTGATCTTGAAAACACAAAATCACGCCAACAA 
GCCATTAAGGATCTAAACGAATTCTTAAAAACAATCCCAATGACGCCAGGCCTCTAAAACTTTAGCTCAAGCTAAT 
AAAATACAACACCTGAGGACCTTAAATCTAAGGTTCATTCAATAAAACCCATTGATCTTGAAAACACAAAATCACG 
CCAACAAGCCATTAAGGATCTAAACGAATTCTTAAAAACAATCCCAATGACGCCAGGCCTCTAAAACTTTAGCTCA 
AGCTAATAAAATACAACACCTAGAGGACCTTAAATCTAAGGTTCATTCAATAAAACCCATTGATCTTGAAAACACA 
AAATCACG-CCAACAAGCCATTAAGGATCTAAACGAATTCTTAAAAAACAATCCCAATGACGCCCAGGCCTCTAAAA 
CTTTAGCTCAAGCTAATAAAATAC 

AACACCTGGAGGACCTTAAATCTAAGGTTCATTCAATAAAACCCATTGATCTTGAAAACACAAAATCACGCCAACA 
AGCCATTAAGGATCTAAACGAATTCTTAAAAACAATCCCAATGA.CGCCCAGGCCTCTAAAAC r f 1 TTAGCTCAAGCTT 
ATGAAAACAATGGAGATTTGCTAAAAGCAGAAAATGCATACGAAAAAATTATCAAACTCACAAATACCCAAGAAGA 
TCACTATAAACTTGGAATCATTAGATTCAAGCTTAAAAAGTATGAACACTCAATAGAATCATTTGATCAAACAATA 
AAACTCGACCCAAAACATAAAAAAGCACTTCATAACAAAGGAATAGCTTTAATGATGCTAAATAAAAACAAAAA^G 
CAATAGAATCTTTTGAGAAAGCAATACAAATTGATAAAAATTATGGCACCGCCTACTACCAAAAAGGAATAGCAGA 
AGAJVAAAAATGCCGATATGCAACAAGCATTTGCAAGCTTTAAAAATGCCTACAATCTCGACAAAAACCCCAATTAT 
GCATTAAAAGCAGGAATAGTATCAAATAACTTGGGCAACTTCAAACAAAGTGAAGAGTA'TT^AAATTTT^TT^G 
CCAATGCAAAAAAACCTAACGAAATTGCTATTTACAACCTATCAATAGCAAAATTTGAAAACAATAAACTTGAAGA 
ATCTCTTGAAACAATAAACAAAGCCATAGATTTAAATCCAGAAAAAAGTGAATATTTATA" p TTAAAAGCi^'T'CTATA. 
AATCTTAAAAAAGAAAATTACCAAAATGCTATATCACTTTACAGCTTAGTAATTGAAAAAAACCCTGAAA &CEACTT 
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CAGCCTATATAAACCTGGCAAAAGCATATGAAAAATCAGGiW\TAAAAGTCAAGCAATCTCAACTCTTGAAAAGAT 
AATAAACAAAAATAATAAATTAGCCTTAAACAATCTTGGGATACTTTACAAAAAAGAAAAAAATTATCAAAAAGCA 
ATTGAAATTTTTGAAAAAGCAATAATCAATTCAGATATTGAAGCAAAATATAATCTTGCAACCACTCTAATTGAAA 
TTAATGATAACACAAGAGCTAAAGACCTTCTAAGAGAATATACAAAATTAAAACCAAACAATCCAGAGGCCTTACA 
TGCACTAGGAATAATAGAATATAATGAAAATAACAATGATCAAACACTAAGAGAACTATAAAAAAATTTCCAAATT 
ACAAAAAAAATGAAAATATTAAAAAAATAATAGGAATATAA 

f743 .aa 

MRIYLFLNKNYKIFILFLILILNSKLAYSQRLIRIGKEEMKNKNYIQAIETLSDAIKKYPKVQLGYYFLSTAYREN 
NQLTEAEGALLDGIAVGGEIDYILYYELGNIMFNRGEGYYPLAIKYYSNSIKSRPNYDSALLNRANAYVQQGKITS 
KEKEYQKAWDSYTMAIHDYSQFITLRSKTEKKDSILLIISYLRNEKIOT.EQLDKSLKGRTEHIVYAKEDKNQILKD 
SFKDNLETNSLIELEKLNWQEELYIDE 

t743 .aa 

YSQRLIRIGKEEMKNKNYIQAIETLSDAIKKYPKVQLGYYFLSIAYRENNQLTEAEGALLDGIAVGGEIDYILYYE 
LGNIMFNRGEGYYPLAIKYYSNSIKSRPNYDSALL^JRANAYVQQGKITSKEKEYQKAWDSYTMAIHDYSQFITLRS 
KTEKKDSILLIISYLRNEKINLEQLDKSLKGRTEHIVYAKEDKNQILKDSFKDNLETNSLIELEKLNWQEELYIDE 

f743 .nt 

ATGAGGATTTATTTATTTTTAAATAAAAATTACAAGATTTTTATTTTATTTTTAATTTTAATAT^AAAT^CAAAAT 
TGGC 

ATATTCTCAAAGGCTAATTAGAATTGGCAAAGAAGAGATGAAAAACAAAAATTACATTCAAGCAATCGAAACACTA 
AGTGATGCTATTAAAAAATATCCAAAAGTACAACTCGGCTATTACTTTTTATCAATAGCATACAGAGAAAATAATC 
AACTAACAGAAGCAGAAGGAGCATTGCTCGATGGAATTGCAGTAGGGGGTGAAATCGACTACATACTATATTATGA 
ATTAGGCAACATAATGTTTAACAGAGGGGAAGGTTACTATCCTTTAGCAATAAAATATTATTCTAATTCTATTAAA 
AGTAGACCTAATTATGACAGTGCGCTACTAAACAGAGCTAATGCCTATGTTCAACAGGGCAAAATAACTTCTAAAG 
AAAAAGAATACCAAAAAGCTTGGGACTCTTATACTATGGCTATCCACGACTACTCTCAATTTATTACCCTTAGATC 
AAAAACAGAAAAAAAAGACAGCATTTTGCTTATAATAAGCTATTTAAGAAATGAAAAAATTAATCTTGAACAACTT 
GACAAAAGTTTGAAGGGGCGAACCGAGCATATTGTATACGCAAAAGAAGATAAAAATCAAATACTTAAAGATAGTT 
TTAAAGACAACCTAGAAACAAATTCTTTAATTGAGCTAGAAAAACTTAATTGGCAAGAGGAGTTATACA^AGATGA 
ATAA 

t743.nt 

TATTCTCAAAGGCTAATTAGAATTGGCAAAGAAGAGATGAAAAACAAAAATTACATTCAAGCAATCGAAACACTAA 
GTGATGCTATTAAAAAATATCCAAAAGTACAACTCGGCTATTACTTTTTATCAATAGCATACAGAGAAAATAATCZ\ 
ACTAACAGAAGCAGAAGGAGCATTGCTCGATGGAATTGCAGTAGGGGGTGAAATCGACTACATACTATATTATGAA 
TTAGGCAACATAATGTTTAACAGAGGGGAAGGTTACTATCCTTTAGCAATAAAATATTATTCTAATTCTATTAAAA 
GTAGACCTAATTATGACAGTGCGCTACTAAACAGAGCTAATGCCTATGTTCAACAGGGCAAAATAACTTCTAAAGA 
AAAAGAATACCAAAAAGCTTGGGACTCTTATACTATGGCTATCCACGACTACTCTCAATTTATTACCCTTAGATCA 
AAAACAGAAAAAAAAGACAGCATTTTGCTTATAATAAGCTATTTAAGAAATGAAAAAATTAATCTTGAACAACTTG 

acaaaagtttgaaggggcgaaccgagcatattgtatacgcaaaagaagataaaaatcaaatacttaaagatagttt 

TAAAGACAACCTAGAAACAAATTCTTTAATTGAGCTAGAAAAACTTAATTGGCAAGAGGAGTTAT^CATAGATGAA 
TAA 

f748.aa 

MKFIINLLLSTIKIITFTVIVCLTILSIFQPIYILKENEISITTRLGKIQRTENLAGLKYKIPLIENVQIFPKIIL 
RTOGEPQRTPTGGEEKQLIWIDTTARWKIADINKFYTTIKTMSRAWRIDAAIEPAVRGVI-AKYPLLEIIRSSNDP 
IQRLSNGILTPQETKINGrYKITKGRKIlEKEIIRIANNNTKDIGIETVDVLIRKVTYDPSLIESVT^NRMISERQQ 
IAEEQRSIGLAEKTEILGSIEKEKLKILSEAKATAAKIKAEGDREAAKIYSNAYGKNIEFYKFWQALESYKAVLKD 
KRKIFSTDMDFFQYLHKRN 



TABLE 1 . Nucleotide and Amino Acid Sequences 



IFQPIYILKHNEISZTTRLGKIQRTZNLAGLKYKZPIalS^/QIFPKIILRWDGEPQRIPTGGEEKQLIWIDTTARW 
KIADINK7YTTIXZ3*S?JVZVR^ 

IISKEIZRZAN^TZTUDZGIZZVZTLZRJT/^YDPSL^ 
LSEAKATAAKZKAZGDPZ^J^ZYSNAZ/GJZ^ 



ATGAAATTTATAAZ-AAATCT^TZATC'ATCTACTATAAAGATTATAACCTTTACAGTAATAGTTTGCTTGACTATTT 
TGTCTATTCTCCAGCCAATTTAZATZTTGAAAGAA^ 

aactgaaaatttagctggacttaaacataaaataccattaattgaaaatgtgcaaatatttcccaaaatcattctt 

agatgc<;-azggagaacctcaaz\gaazccc^^^^ 

gajtggaaaattgcagacataaataaat^^ 

agcaattgaacczgctgttagg&gggttattgcaaaataccctttgcttgaaattataagaagctcaaacgatcct 
attcaacgtttgzctaaz'ggajitactcaccccacaa 

gaaagataatcgaaaaagaaata\tzcgzatagcaaacaacaataccaaagatattggaattgaaattgtagacgt 
actaataagaaaigttacttatgacccaagccttattgaatctgtaaacaacagaatgatctcagaaagacaacaa 
atcgcagaagaacaaagaagcazaggatcagctgaa^ 

AAATATZAAGTGAAGCA>J^GCCACZGCZGCAAAA^ 

AAATGCATA.TGGCAAAAATATTGAAZTTZACAjyiTTCTGGCAGGCATTAGAAAGCTATAAAGCAGTATTAAAAGA.T 
AAAAGAAAJ^TT^ZCTCAACAGACArGGATTTCTTTCAATATCTTCACAAAAGAAATTGA 

t748 .nt 

ATTTTCCAGCCAAZZTAZATTT^GAAAGAAAATGA^^ 

AAAATTTAGCTGGACTTAAATAZAAAATACCATTAATTGAAAATGTGCAAATATTTCCCAAAATCATTCTTAGATG 
GGATGGAGAACCTCAAAGAA"T'CCCAACAGGA-GGGGAAGAAAAGCAATTAATATGGATTGATACAACTGCTAGATGG 
AAAATTGCAGACArAAAZAAATZTTACACAACAATAAAAACAATGAGTAGAGCTTACGTTAGAATTGATGCAGCAA 
TTGAACCTGCTGTCAGGGGGGTCATTGCAAAATACCCTTTGCTTGAAATTATAAGAAGCTCAAACGATCCTATTCA 
ACGTTTGTCTAA~<^AATACTCACCCCACAJiGAAACAAAAATTAACGGTATTTATAAAATAACAAAAGGACGAAAG 
ATAATCGAAAAAQAZAATAAZTCGZAZAGCAAJVCAACAJVTACCAJ^GATA.TTGGAATTGAAATTGTAGACGTACTAA- 
TAAGAAAAGTTACZCATGACCCAiGCCTCAT^GAATCTGTAAACAACAGAATGATCTCAGAAAGACAACAAATCGC 
AGAAGAACAAAGAAGCATAGGATZAGCTGAAAAAACAGAAATTCTTGGAAGCATAGAAAAAGAAAAACTGAAAATA 
TTAAGTGAAGCAAAAGCCACZGCTGCAAAAATAAAAGCCGAAGGGGATAGA 

CATATGGCAAAAACATTGAATTT^A-CAAJ^TTCTGGCAGGCATTAGAAAGCTATAAAGCAGTATTAAAAGATAAAAG 
AAAAATTTZ'CTCAACAGACATGGA'TZ'TCTTTCAATATCTTCACAAAAGAAATTGA 

f7S4.aa 

MSGPKKIAIIAL1V~ SIQGCXESSIZEKQFire^ 
ILLTHSSNNEII^^ILXEXIFYLSXIX^ 

TLINEKNISYIQZ'FITSQIKTIIZFSLRDNl'JIILKKILNSPFSKNIKFVLIGNTRKDLXIIKLKYIITLKEPDLIK 
I AXDV3KDFQ YE FNZ YKQ 

f7S4.aa 

ekqfitcaiifsdazeyffszqtzpfzknzilfzn^ 
ikfslxxsj.dfll^xszdlqktllfrd:<su 

lrd^zilkkiz^s?fs:<nzkf-/z.ignt?z<dlkiiklkyiitlkepdlikiakdvekdfqyefniykq 

f7S4.r.c 

ATGTCZGGCCCZAAAAA\CZTGCTAZAAZAGCGCTCTTAGTAATTTCAATACAAGGATGCAAAGAATCTTCTATTA 
TTGAAAAi.CAAZTCAATZ'AZGCAATAATZTTTTCAGATGCAACTGAATATTTTTTTGAAATTCAAACAACTCCATT 
CATAAAA\ACGAA-.TACZATTTAZAAATGACAAA^\TTTAGAAATTATAAAAGACAAGCTTAA 
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ATACTATTAACTCATAAATCAAATAATGAAATTCTAAATAACGAAATTCTAAAJ^GAGAAAATTTTTTATCTATCAA 
AAATAAAATTTTCTCTAAAAAAATCTATTGACTTTCTGCTTAACGAAAAATCAATAGATTTGCAAAAAACA^TixCT 
ATTTAGAGACAAATCTCTAAATAACGAAGACCTTGAATACTTGGAAAAAAAAGGCAAAGAAAAAAATGTC^ATATT 
ACTCTAATAAACGAAAAAAACATATCCTATATTCAAACATTCATTACTTCTCAAATAAAAACAATAATATTATTCT 
CTTTAAGAGATAATAATATTATTTTAAAAAAGATACTAAATTCGCCTTTTTCTAAAAATATAAAATTTGTATTAAT 
TGGCAATACAAGAAAAGACTTAAAAATTATTAAGCTAAAATATATAATCACCCTTAAAGAGCCTGATTTGATAAAA 
ATAGCAAAAGATGTTGAAAAAGATTTTCAATATGAATTTAACATTTATAAACAATAA 

t764.nt 

GAAAAACAATTTAATTATGCAATAATTTTTTCAGATGCAACTGAATATTTTTTTGAAATTCAAACAACTCCATTCA 
TAAAAAACGAAATACTATTTATAAATGACAAAAATTTAGAAATTATAAAAGACAAGCTTAAAACAACAAAAAAAAT 
ACTATTAACTCATAAATCAAATAATGAAATTCTAAATAACGAAATTCTAAAAGAGAAAATTTTT^TCT'ATCAAAA 
ATAAAATTTTCTCTAAAAAAATCTATTGACTTTCTGCTTAACGAAAAATCAATAGATTTGCAAAAAACATTSCTAT 
TTAGAGACAAATCTCTAAATAACGAAGACCTTGAATACTTGGAAAAAAAAGGCAAAGAAAAAAATGTCAATATTAC 
TCTAATAAACGAAAAAAACATATCCTATATTCAAACATTCATTACTTCTCAAATAAAAACAATAATATTATTCTCT 
TTAAGAGATAATAATATTATTTTAAAAAAGATACTAAATTCGCCTTTTTCTAAAAATATAAAATTTGTATTAATTG 
GCAATACAAGAAAAGACTTAAAAATTATTAAGCTAAAATATATAATCACCCTTAAAGAGCCTGATTTGATAAAAAT 
AGCAAAAGATGTTGAAAAAGATTTTCAATATGAATTTAACATTTATAAACAATAA 

f770.aa 

MINFSKSFFYPLPIGKIFVLSGDMGSGKTSFLKGLALNLGISYFTSPTYNIVNVYDFINFKFYHIDLYRVSSLEEF 
ELVGGLEILMDLDSIIAIEWPQIALSIVPKDRLFSLTFKIVGSGRWELNG 

t770 .aa 

KTSFL,KGLALNLGISYFTSPTYNIVNVYDFINFKFYHIDLYRVSSLEEFELVGGLEILMDLDSIIAIEWPQIALSI 
VPKDRLFSLTFKIVGSGRWELNG 

£770. nt 

ATGATAAATTTTTCCAAATCTTTTTTTTATCCTTTGCCAATTGGTAAAATATTTGTTTTAAGTGGTGACATGGGAT 

CTGGAAAAACTAGTTTTTTAAAGGGACTTGCCCTTAACCTTGGAATTTCTTATTTTACAAGTCCAACTTATAACAT 

TGTTAATGTTTATGATTTTATAAATTTTAAATTTTATCATATTGATTTATATCGGGTGTCTTCTTTGGAAGAATTT 

GAGCTTGTTGGGGGATTGGAAATACTTATGGATCTTGACTCGATTATTGCTATTGAATGGCCACAAAT^GCTTTGA 

GCATTGTTCCAAAAGATAGATTATTTTCTTTAACTTTTAAAATAGTAGGTTCAGGCAGGGTTGTAGAACTTAATGG 
TTAA 

t770.nt 

AAAACTAGTTTTTTAAAGGGACTTGCCCTTAACCTTGGAATTTCTTATTTTACAAGTCCAACTTATAACATTGTTA 
ATGTTTATGATTTTATAAATTTTAAATTTTATCATATTGATTTATATCGGGTGTCTTCTTTGGAAGAATTTGAGCT 
TGTTGGGGGATTGGAAATACTTATGGATCTTGACTCGATTATTGCTATTGAATGGCCACAAATTGCTTTGAGCATT 
GTTCCAAAAGATAGATTATTTTCTTTAACTTTTAAAATAGTAGGTTCAGGCAGGGTTGTAGAACTTAATGGTTAA 

f7 90.aa 

MNTKATTPLLLLFLIQSLAFSSEIFEFKYIKGSKFRLEGTDNQKIYFNGHY^TSSSNTNIQISSEIKDIKENF?SIK 

AFFRILKRENINEPYLLNEEFEEIFSVNKQGEYTIGANQKRPSVRGIPRFPKTPIKINEKWSYLAEEYIEASKIDK 

SIKDFV^F^yWYEYKGKEEHNGKHYHIILSNYESQYNVKNISFYQKVDQKIYFDNEIGNTYKYSDKYIFSTNQNN 

NQriFKi^IGNSLGRIVSIELPNDNLIETEVENYIREKKIKAIEVEKlNTOKGINLSFDIEFYPETSFOILQKEYKKTDLI 

AKLLSKFKICNNILxEGHTEQFGLEEEMHELSEKRARAIGNYLIKMKVKDKDQILFKGWGSQKPKYPKSSPLKAKNR 
RVEITILNN 



t7.9 0 .aa 
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SEIFEFKYIKGSKFRLEGTDNQKIYFNGHYNSSSNTNIQISSEIKDIKENFASIKAFFRILKRENINEPYLLNEEF 
EEIFSWKQGEYTIGANQKRPSVRGIPRFPKTPIKINEKWSYLAEEYIEASKIDKSIKDFWKFNVNYEYXGKSEH 
NGKHYHIILSNYESQYWKNISFYQKVDQKIYFDNEIGNTYKYSDKYIFEINQM^QHFKMIGNSLGRIVSIELPN 
DNLIETEVENYIREKKIKAIEVEKNNKGINLSFDIEFYPNSFQILQKEYKKIDLIAKLLEKFKKNNILIEGHTEQF 
GLEEEMHELSEKFJVRAIGNYLIKMKVKDKDQILFKGWGSQKPKYPKSSPLKAKNRRVEITILNN 

£790. nt 

ATGAATACCAAGGCGACTACACCATTGTTGTTATTATTTTTAATTCAAAGCTTAGCTTTTTCTTCTGAAATCTTTG 
AATTTAAATACATTAAAGGTTCAAAGTTTAGATTAGAAGGCACAGATAATCAAAAAATATATTTCAATGGCCATTA 
TAATTCAAGCTCTAATACCAATATTCAAATTTCAAGTGAAATAAAAGACATAAAAGAAAACTTTGCAAGCATTAAA 
GCTTTTTTTAGAATCTTAAAAAGAGAAAATATTAATGAACCTTACCTATTAAATGAAGAGTTTGAAGAAATCTTCA 
GCGTAAATAAGCAAGGAGAATATACAATAGGAGCAAATCAAAAAAGACCTTCTGTTAGAGGTATTCCAAGATTCCC 
AAAAACACCAATCAAAATAAATGAAAAATGGTCATATCTTGCAGAAGAATATATAGAAGCGTCAAAAATAGACAAA 
AGTATAAAAGATTTCGTTGTAAAATTTAATGTTAACTACGAATATAAAGGCAAAGAAGAGCACAATGGCAAGCATT 
ACCACATAATTCTTTCGAATTATGAATCACAATACAATGTAAAAAACATCTCTTTCTATCAAAAAGTAGACCAAAA 
AATTTATTTTGATAATGAAATTGGCAATACATATAAATACAGCGATAAATATATATTTGAAATAAATCAGAACAAC 
AACCAACATTTTAAAATGATTGGAAACTCTCTTGGCAGAATAGTTTCAATTGAGCTTCCAAATGATAATCTTATTG 
AAACTGAGGTTGAAAATTACATCCGAGAAAAAAAAATAAAAGCTATTGAAGTTGAAAAAAACAATAAAGGTATTAA 
TTTAAGCTTTGACATTGAATTTTATCCTAACTCATTTCAAATACTACAAAAAGAATATAAAAAAATTGACCTTATA 
GCTAAACTTCTTGAAAAATTTAAAAAAAATAACATACTAATAGAAGGACATACTGAGCAATTTGGATTGGAAGAAG 
AGATGCACGAGCTATCTGAAAAAAGAGCTCGTGCAATTGGAAATTATTTAATAAAAATGAAAGTAAAAGACAAAGA 
CCAAATACTATTTAAAGGATGGGGATCTCAAAAACCAAAATATCCTAAGTCCTCCCCATTAAAGGCTAAAAATAGG 
CGAGTAGAAATTACAATATTAAATAACTAA 

t790.nt 

TCTGAAATCTTTGAATTTAAATACATTAAAGGTTCAAAGTTTAGATTAGAAGGCACAGATAATCAAAAAATATATT 
TCAATGGCCATTATAATTCAAGCTCTAATACCAATATTCAAATTTCAAGTGAAATAAAAGACATAAAAGAAAACTT 
TGCAAGCATTAAAGCTTTTTTTAGAATCTTAAAAAGAGAAAATATTAATGAACCTTACCTATTAAATGAAGAGTTT 
GAAGAAATCTTCAGCGTAAATAAGCAAGGAGAATATACAATAGGAGCAAATCAAAAAAGACCTTCTGTTAGAGGTA 
TTCCAAGATTCCCAAAAACACCAATCAAAATAAATGAAAAATGGTCATATCTTGCAGAAGAATATATAGAAGCGTC 
AAAAATAGACAAAAGTATAAAAGATTTCGTTGTAAAATTTAATGTTAACTACGAATATAAAGGCAAAGAAGAGCAC 
AATGGCAAGCATTACCACATAATTCTTTCGAATTATGAATCACAATACAATGTAAAAAACATCTCTTTCTATCAAA 
AAGTAGACCAAAAAATTTATTTTGATAATGAAATTGGCAATACATATAAATACAGCGATAAATATATATTTGAAAT 
AAATCAGAACAACAACCAACATTTTAAAATGATTGGAAACTCTCTTGGCAGAATAGTTTCAATTGAGCTTCCAAAT 
GATAATCTTATTGAAACTGAGGTTGAAAATTACATCCGAGAAAAAAAAATAAAAGCTATTGAAGTTGAAAAAAACA 
ATAAAGGTATTAATTTAAGCTTTGACATTGAATTTTATCCTAACTCATTTCAAATACTACAAAAAGAATATAAAAA 
AATTGACCTTATAGCTAAACTTCTTGAAAAATTTAAAAAAAATAAGATACTAATAGAAGGACATACTGAGCAATTT 
GGATTGGAAGAAGAGATGCACGAGCTATCTGAAAAAAGAGCTCGTGCAATTGGAAATTATTTAATAAAAATGAAAG 
TAAAAGACAAAGACCAAATACTATTTAAAGGATGGGGATCTCAAAAACCAAAATATCCTAAGTCCTCCCCATTAAA 
GGCTAAAAATAGGCGAGTAGAAATTACAATATTAAATAACTAA 

f792.aa 

MKIFIYWWIFFFSVFKVFSIYSLTDEEFFKKYSLFFVHKGFLSKNVNGKITKVQVTMGINSRWVYPFYKLVPSRIT 
SIYEDVYSSSSFLTTSNNLYVSYDYSKNFRKLVGXDKFNSGAYITSSAFSQGDYKRIAIGTAIHGIYLSVNGAISF 
KN-LNRLIPQIYLGAGYYDIISAIEFSKEETNNLYFSSGVYGDIFLISQKSGFIKKI5FPFKKOIIRILDLSSKNVE 
KILWTYDNHFYSYINGQWFIGKLSLQDQDFFEKSQRMQLA20JIKGSIYLTAYTLRNKKAVDERFKFIKDSGMNAV 
VIDFKDDNGNLTYSSKLSLPNKLRSVKNFIDVPYILKKAKELGIY^/IARCWFKDSKLYYYDWFKHALWNKKTNKP 
WAHLIKKVDSSGLVKWQVEHWDIFSPATWEYNISIAKEIQSFGVDEIQFDYIRFPSDGPVSLAISRMNKYEMQP 
VDALESFLIMAREQLYVPISVDIYGYNGWFPTNSIGQNISMLSDYVDVISPMFYPSHYTDDFLPSNFYYTKRAYRI 
YKEGSDRALAFSLDGWIRPYVQAFLLGKERLVDDEIYLEYLKFQLKGIKESFGSGFSLWNASNVYYMIKGSLKEY 
LDSF 
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t792 .aa 



IYSLTDEEFFKKYSLFFVHKGFLSK^^VNGKITKVQVNGINSRWVyPFYKLVPSRITSIYEDVYSSSS^ ::, LTTSNNLY 
VSYDYSKNFRKLVGIDKFNSGAYITSSAFSQGDYKRIAIGTAIHGIYLSVNGAISFKNLNRLIPQIYLGAGYYDII 
SAIEFSKEETNNLYFSSGVYGDIFLISQKSGFIKKISFPFKKQIIRILDLSSKNVEKILVRTYDNHFYSYINGQWV 
FIGKLSLQDQDFFEKSQRMQLAKNKGSIYLTAYTLRNKKAVDERFKFIKDSGMNAWIDFKDDNGNLTYSSKLSLP 
NKLRSVKNFIDVPYILKKAKELGIYVIARCWFKDSKLYYYDNFKHALWNKKTNKPWAHLIKKVDSSGLVKYVQVE 
HWVDIFSPATWEYNISIAKEIQSFGVDEIQFDYIRFPSDGPVSLAISRMNKYEMQPVDALESFLIMAREQLYVPIS 
VDIYGYNGWFPTNSIGQNISMLSDYVDVISPMFYPSHYTDDFLPSNFYYTKRAYRIYKEGSDRALAFSLDGWIRP 
' YVQAFLLGKERLVDDEIYLEYLKFQLKGIKESFGSGFSLWNASNVYYMIKGSLKEYLDSF 

f792 .nt 

ATGAAAATTTTTATCTATTGGGTAGTTATTTTCTTCTTTTCTGTTTTCAAGGTTTTTAGTATATA'PTCATTAACCG 
ATGAAGAATTTTTTAAAAAATATAGTTTATTTTTTGTTCATAAAGGATTTTTAAGTAAAAATGTTAATGGGAAAAT 
AACCAAAGTTCAAGTCAATGGGATAAATTCTAGGTGGGTTTACCCTTTTTATAAGCTTGTTCCTAGTCGAATTACT 
TCTATTTATGAGGATGTTTATTCTTCAAGTTCATTTTTGACTACAAGTAACAATCTTTATGTTTCTTATGATTATT 
CAAAAAATTTTAGAAAATTAGTAGGAATTGATAAATTTAATAGCGGTGCATATATTACATCTAGTGCCTTTTCTCA 
AGGAGATTACAAGCGTATTGCTATTGGAACTGCGATTCATGGTATTTATCTTAGTGTTAATGGAGC^ATTAGTTTT 
AAAAATTTAAATCGTTTGATTCCGCAGATTTATTTAGGTGCAGGATATTACGATATTATTAGTGCTATTGAATTTT 
CAAAAGAAGAGACAAATAATTTATATTTTTCCTCTGGAGTTTATGGAGATATTTTTTTAATTAGTCAGAAAAGTGG 
ATTTATTAAAAAAATATCTTTTCCTTTCAAAAAGCAAATAATACGTATTTTAGACTTATCTAGTAAGAATGTAGAA 
AAAATTTTAGTCAGAACATATGACAATCATTTTTATTCTTATATTAATGGGCAATGGGTATTTATTGGAAAATTAT 
CTTTGCAGGATCAGGATTTTTTTGAAAAATCACAAAGGATGCAGCTTGCTAAAAATAAAGGGTCTATTTATTTAAC 
AGCATATACATTGCGTAATAAGAAGGCAGTTGATGAAAGATTTAAATTTATTAAAGATTCAGGTATGAATGCTGT'T 
GTAATTGATTTTAAAGATGATAATGGTAATTTGACTTATTCTAGCAAGCTTTCTTTGCCCAATAAGTTGAGATCTG 
TTAAAAACTTTATTGATGTTCCTTATATTCTTAAAAAAGCAAAAGAGCTTGGAATTTATGTTATTGCTAGATGTGT 
T-GTATTTAAAGATTCAAAATTGTATTATTATGATAATTTTAAACACGCCCTTTGGAATAAAAAAACCAATAAACCT 
TGGGCTCATTTGATTAAAAAAGTTGATTCTAGTGGTCTTGTGAAATATGTACAAGTAGAGCATTGGGTAGATATTT 
TTTCTCCTGCTACTTGGGAATATAATATTTCTATCGCAAAAGAAATTCAATCTTTTGGAGTTGACGAGATACAATT 
TGATTATATTAGATTTCCATCAGATGGGCCTGTGTCTCTTGCAATCTCAAGAATGAATAAGTATGAGATGCAACCC 
GTTGATGCACTTGAATCTTTTTTGATTATGGCAAGAGAACAGCTTTATGTTCCTATTTCTGTTGATATTTATGGGT 
ACAATGGCTGGTTTCCTACTAATAGTATTGGGCAAAATATTTCAATGTTATCAGATTATGTTGACGTCATATCTCC 
TATGTTTTATCCTTCGCATTATACTGATGATTTTTTGCCAAGCAATTTTTATTACACAAAAAGAGCTTATAGGATT 
TATAAAGAGGGGAGTGATAGAGCACTTGCTTTTTCTTTAGATGGGGTTGTTATTAGGCCTTATGTTCAAGCTTTTT 
TATTAGGAAAAGAAAGATTGGTGGATGACGAGATTTATTTGGAGTATTTAAAGTTTCAGCTTAAAGGAATTAAAGA 
GTCATTTGGTAGTGGCTTTAGCCTTTGGAATGCATCTAATGTTTATTATATGATTAAAGGTAGTTTAAAAGAATAT 
TTAGATTCTTTTTA 

t792.nt 

ATATATTCATTAACCGATGAAGAATTTTTTAAAAAATATAGTTTATTTTTTGTTCATAAAGGATTTTTAAGTAAAA 
ATGTTAATGGGAAAATAACCAAAGTTCAAGTCAATGGGATAAATTCTAGGTGGGTTTACCCTTTTTATAAGCTTGT 
TCCTAGTCGAATTACTTCTATTTATGAGGATGTTTATTCTTCAAGTTCATTTTTGACTACAAGTAACAATCTTTAT 
GTTTCTTATGATTATTCAAAAAATTTTAGAAAATTAGTAGGAATTGATAAATTTAATAGCGGTGCATATATTACAT 
CTAGTGCCTTTTCTCAAGGAGATTACAAGCGTATTGCTATTGGAACTGCGATTCATGGTATTTATCTTAGTGTTAA 
TGGAGCTATTAGTTTTAAAAATTTAAATCGTTTGATTCCGCAGATTTATTTAGGTGCAGGATATTACGATATTATT 
AGTGCTATTGAATTTTCAAAAGAAGAGACAAATAATTTATATTTTTCCTCTGGAGTTTATGGAGATATTTTTTTAA 
TTAGTCAGAAAAGTGGATTTATTAAAAAAATATCTTTTCCTTTCAAAAAGCAAATAATACGTATTTTAGACTTATC 
TAGTAAGAATGTAGAAAAAATTTTAGTCAGAACATATGACAATCATTTTTATTCTTATATTAATGGGCAATGGG^A 
TTTATTGGAAAATTATCTTTGCAGGATCAGGATTTTTTTGAAAAATCACAAAGGATGCAGCTTGCTAAAAATAAAG 
GGTCTATTTATTT AAC AGCATAT ACATTGCGTAAT AAGAAGGCAGTTGATGAAAGATTTAAAT r f 1 T .A TT AAAG A.TTC 
AGGTATGAATGCTGTTGTAATTGATTTTAAAGATGATAATGGTAATTTGACTTATTCTAGCA^GCTTTCT m TGCCC 
AATAAGTTGAGATCTGTTAAAAACTTTATTGATGTTCCTTATATTCTTAAAAAAGCAAAAGAGCTTGGAATTT.A^ G 
TTATTGCTAGATGTGTTGTATTTAAAGATTCAAAATTGTATTATTATGATAATTTTAAACACGCCCTTTGGAATAA 
AAAAACCAATAAACCTTGGGCTCATTTGATTAAAAAAGTTGATTCTAGTGGTCTTGTGAAATATGTACAAGTAGAG 
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CATTGGGTAGATATTTTTTCTCCTGCTACTTGGGAATATAATATTTCTATCGCAAAAGAAATTCAATCTTTTGGAG 
TTGACGAGATACAATTTGATTATATTAGATTTCCATCAGATGGGCCTGTGTCTCTTGCAATCTCAAGAATGAATAA 
GTATGAGATGCAACCCGTTGATGCACTTGAATCTTTTTTGATTATGGCAAGAGAACAGCTTTATGTTCCTATTTCT 
GTTGATATTTATGGGTACAATGGCTGGTTTCCTACTAATAGTATTGGGCAAAATATTTCAATGTTATCAGATTATG 
TTGACGTCATATCTCCTATGTTTTATCCTTCGCATTATACTGATGATTTTTTGCCAAGCAATTTTTATTACACAAA 
AAGAGCTTATAGGATTTATAAAGAGGGGAGTGATAGAGCACTTGCTTTTTCTTTAGATGGGGTTGTTATTAGGCCT 
TATGTTCAAGCTTTTTTATTAGGAAAAGAAAGATTGGTGGATGACGAGATTTATTTGGAGTATTTAAAGTTTCAGC 
TTAAAGGAATTAAAGAGTCATTTGGTAGTGGCTTTAGCCTTTGGAATGCATCTAATGTTTATTATATGATTAAAGG 
TAGTTTAAAAGAATATTTAGATTCTTTTTAA 

f797 .aa 

MSIKKFILTLIILSLAKNSFSENEINIFSNENYIVK2NIKTEIKKLKQSFLLASVDVAISQPYIELADLNGEPIKE 
LEGISYSFINVFSKIGSSAIISFDLSNEASKKYKIIKLEFLSPDKGNFINQLSSLTSGKQQSKKELAKDAYSFGTL 
RTESLSKTIAEYYKDNNWYYILAAITVENNINKETEKYEIRINPKIYNDFQKKLRLKFKSNQIKKFPIPIIE 

t797.aa 

KNSFSENSINIFENENYIVKENIKTEIKKLKQSFLLASVDVAISQPYIELADLWGEPIKELEGISYSFINVFSKIG 
SSAIISFDLSNEASKKYKIIKLEFLSPDKGNFINQLSSLTSGKQQSKKELAKDAYSFGTLRTESLSKTIAEYYKDN 
NWYYILAAITVENNINKETEKYEIRINPKIYNDFQKKLRLHFKSNQIKKFPIPIIE 

f797 .nt 

ATGAGCATTAAAAAATTTATTTTAACCTTGATAATTCTTTCTCTAGCTAAAAATAGCTTTTCTGAAAACGAAATTA 
ATATCTTCGAAAACGAAAATTATATTGTAAAAGAAAATATAAAAACAGAAATTAAAAAACTAAAACAAAGTTTTTT 
ACTTGCATCTGTTGATGTCGCCATTAGCCAACCCTACATAGAATTGGCAGATTTAAATGGAGAACCGATAAAAGAA 
CTTGAAGGGATTAGTTATTCATTTATAAATGTATTTTCAAAAATTGGATCTTCTGCTATTATTTCATTTGACCTAT 
CAAACGAAGCTTCCAAGAAATACAAAATCATAAAATTAGAATTTTTAAGTCCAGATAAAGGCAATTTTATTAACCA 
GCTAAGCAGCCTTACTAGTGGAAAACAGCAATCAAAAAAAGAGCTTGCAAAAGACGCTTACTCATTTGGTACATTA 
AGAACTGAATCTCTTTCAAAAACAATTGCAGAATATTACAAAGATAACAACTGGTATTATATTTTAGCAGCAATAA 
CAGTAGAAAATAATATAAATAAAGAAACTGAAAAATACGAAATTAGAATTAACCCTAAAATATATAATGATTTTCA 
AAAAAAATTGAGATTACATTTTAAAAGCAACCAAATAAAAAAATTTCCAATACCCATTATAGAATAA 

t797.nt 

AAAAATAGCTTTTCTGAAAACGAAATTAATATCTTCGAAAACGAAAATTATATTGTAAAAGAAAATATAAAAACAG 
AAATTAAAAAACTAAAACAAAGTTTTTTACTTGCATCTGTTGATGTCGCCATTAGCCAACCCTACATAGAATTGGC 
AGATTTAAATGGAGAACCGATAAAAGAACTTGAAGGGATTAGTTATTCATTTATAAATGTATTTTCAAAAATTGGA 
TCTTCTGCTATTATTTCATTTGACCTATCAAACGAAGCTTCCAAGAAATACAAAATCATAAAATTAGAATTTTTAA 
GTCCAGATAAAGGCAATTTTATTAACCAGCTAAGCAGCCTTACTAGTGGAAAACAGCAATCAAAAAAAGAGCTTGC 
AAAAGACGCTTACTCATTTGGTACATTAAGAACTGAATCTCTTTCAAAAACAATTGCAGAATATTACAAAGATAAC 
AACTGGTATTATATTTTAGCAGCAATAACAGTAGAAAATAATATAAATAAAGAAACTGAAAAATACGAAATTAGAA 
TTAACCCTAAAATATATAATGATTTTCAAAAAAAATTGAGATTACATTTTAAAAGCAACCAAATAAAAAAATTTCC 
AATACCCATTATAGAATAA 

f799.aa 

MKKHIIIGIIFVAILLFFKILLIPRIQNHEOTSIKNNIKMIISYKQDKNFiSLKINIKTKKTTNLGKii 
IESNLLYISSKNFTTYANIIYQNESLLSIILKSNGNW/FYSKRIKPRGXI 

t799 .aa 

HENNKKNIKMIISYKQDKNRLSLKIN^ 
I ILKSNGNNNVFYSKRIKPRGKI 




• 



133 

TABLE 1 . Nucleotide and Amino Acid Sequences 

f799 .nt 

ATGAAAAAACATATCATTATTGGGATAArCTr^GTTGCAATTCTTC'TArC'rTTTAAAATTTTATTAATTCCCAGAA 

TTCAAAATCACGAAAArAATAAAAACAArArCAAAACGATAArAAGCTACAAGCAAGACAAA 

AAAGATAAACATAAAAACAAAAAAAACCACCAACCTGGGAAAr-GCC^^ 

ATTGAAAGCAATTTGCtTTTATACAJVGCAGCAAAAAC 

TATTAAGTATAATATTAAAGAGCAACGGCAAr^ 

ATGA 



CACGAAAATAATAAAAATAATATCAAAACGA^ 
ACATAAAAACAAAAAAAACTACCAACCTGGGAA-^GCC^^ 

CAATTTGCTTTATATA-.GCAGCAAAAACrTTACAACArATGCTAA~ArAATCTATCAAAATGAAAGTTTATTAAGT 
ATAATATTAAAGAGTAATGGCAATAATAATG-CTTT^ATAGTAAAAGAACAAAACCTAGAGGTAAAATAT 



MKKHYKALILSLLFAIZ5COTKTLNZLG3ZQ7 KZPFGTLPGAZMPl^ 

FNSYGKLIQTYQNGrFZTNPDLKIKKIDFEGIQAIYPLKDFIZVADKI^TKXSKFNQKENIAYFMRILILNKNSSV 

EILGQEGLNGMPFPQrYDVW/DENGN-AZISIYSEGYII^ 

FFEVNKKTLYVKTTYY3^IGDNHNIN3IlGi:<i:<^^ 

DKIIASTNMKNLSN^riCKLDSKGSII^Zq 



KTLNE LGEEQFK I PFGCL PGAIMPLl'^IKB'TNSKFXlIKrYNGLVYI AEIKTJJKLMI FNSYGKLI QTYQNGI FKTNPD 
LKIKKIDFEGIQAIYPIXDFIIVADXI^CJ^ 
DENGNIAIISIYSEGYZIYSYNXZFS?LYKIYV3TFCNLLKTTD^^ 
NENINDLGIKIKDQYIY3CMSLKXNKZLZVINKIAJ1PCT 

SKGSIKEQIALIEP?NIJIFL5ESLSKDGZLSILYGGXTGVSVr;r^TL>IALLKL 



atgaaaaaacactataaagctcttatayzaac-cytgctttttc<:aattatatcatgtaatactaaaactttaaacg 
aattaggagaagaacaatttaaaataccatttggaacacttcctggtgcaataatgcctctgaataacaaatttac 
aaattcaaaatttgacatcaaaacgta?aj\cgggctagtc 
ttcaactcatacggaaaactaatacaaacatatcaaa^ 

aaatagattttgaaggaj^ttcaagcaatatacccactaaaagattttattattgtcgcagacaaactaaataataa 
aaaatcaaaattcaaccaaaaagagaatattgcctacttcatgagaatactaatactaaacaaaaactcatctgta 
gaaattttgggtca^gajlggtttaaacggaargccatttccacaaatttatgatgttaatgttgatgaaaatggca 

ACATTGCAATAATATCAATATATAGCGAAGGATATATAATATAT'TCTTACAATAAAGAATTTTCCCCGCTTTATAA 
AATTTACGTCAACAAAAACCTGTTAAAAACAATAGACAATCAAAA.GAAAAAATACAACATTTCAATAGATAAGGTT 
TTTTTTGAAGTCAACAAAAA\ACTCTT^ATG~AAAA-.CTACTTACTATGAAAACATTGGTGACAATGAAAATATAA 
ACGATCTTGGAATTAJIAATTAAAGATCAATATATCTATAAAA'TGAGTTTGAAAAAAAA 

AAATAAAATTGCTCTTCCTAAAAACTTACTAGATGACAAACAAGAAAGCTTTATAAACATTATAAAAATACAAAAA 
GACAAAATAATAGCATCTACTAATATGAAAAAT^TATCTAACAATTTAATATGGAAATTAGACAGCAAGGGCTCAA 
TTAAAGAACAAATAGCTTTAATC'GAGCCTCCAAATTTAATGTCTCTCTCTGAGAGTTTATCTAAAGATGGAATACT 
TAGTATACTTTATGGCGGAAAAACTGGTGTTAGCGTCTACTGGTGGAATTTAAATGCATTATTAAAATTATAA 



AAAACTTTAAACGAATCAGGAGAAGAACLAATTTAAAATACCATTT<KAACACTTCCTGGTGCAATAATGCCTCTGA 
ATAACAAATTTACAAAZ'TCAAAATTCGACATCAAAACGTATAACGGGCTAGTGTACATTGCAGAAATAAAAACAAA 
TAAATTAATGATTTTCAAC^CAGACGGAAAACGAATACAAACArATCAAAATGGAATATTTAAAACAAACCCCGAT 
TTAAAAATAAAAAAAArAGATTTTGAAGGAArTCAAGCAATATACCCACTAAAAGATTTTATTATTGTCGCAGACA 



t799 . 



f800 .aa 



t8Q0 .aa 



f800.nt 



t80G .nt 
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AA.CTAAATAATAAAAAATCAAAATTCAACCAAAAAGAGAATATTGCCTACTTCATGAGAATACTAATACTAAACAA 
AAACTCATCTGTAGAAATTTTGGGTCAAGAAGGTTTAAACGGAATGCCATTTCCACAAATTTATGATGTTAATGTT 
GATGAAAATGGCAACATTGCAATAATATCAATATATAGCGAAGGATATATAATATATTCTTACAATAAAGAATTTT 
CCCCGCTTTATAAAATTTACGTCAACAAAAACCTGTTAAAAACAATAGACAATCAAAAGAAAAAATACAACATTTC 
AATAGATAAGGTTTTTTTTGAAGTCAACAAAAAAACTCTTTATGTAAAAACTACTTACTATGAAAACATTGGTC-AC 
AATGAAAATATAAACGATCTTGGAATTAAAATTAAAGATCAATATATCTATAAAATGAGTTTGAAAAAAAACAAAG 
AATTAGAAGTGATAAATAAAATTGCTCTTCCTAAAAACTTACTAGATGATAAACAAGAAAGCTTTATAAACATTAT 
AAAAATACAAAAAGACAAAATAATAGCATCTACTAATATGAAAAATTTATCTAACAATTTAATATGGAAATTAGAC 
AGCAAGGGCTCAATTAAAGAACAAATAGCTTTAATTGAGCCTCCAAATTTAATGTTTCTCTCTGAGAGTTTATCTA 
AAGATGGAATACTTAGTATACTTTATGGCGGAAAAACTGGTGTTAGTGTTTACTGGTGGAATTTAAATGCATTATT 
AAAATTATAA 



MYKLFLFFIIFMFLSCDEKKSSKNLKSVKIGYA/NWGGETAATNVLKWFEKMGYl^IAEIFSVTTSIMYQYLASGKID 
GTVSSWVPTADKFYYEKLKTKFVDLGANYEGTIQGFWPSYVPISSISELKGKGDKFKNKMIGIDAGAGTQIVTEQ 
ALNYYGLSKEYELVPSSESVMLASLDSSIKRNEWILVPLWKPHWAFSRYDIKFLDDPDLIMGGISSVHTLVRLGLE 
NDDFDAYYVFDHFYWSDDLILPLMDKNDKEPGKEYENAVEFVEKNKEIVKTWVPEKYKTLFD 



CDEKKSSKNLKSVKIGYVNWGGETAATNVLKWFEKMGYNAE I F SVTTS IMYQYLASGKIDGT'VSSWVPTADKFYY 
EKLKTKFVDLGANYEGTIQGFVVPSYVPISSISELKGKGDKFKNKMIGIDAGAGTQIVTEQALNYYGLSKEYELVP 
SSESWLASLDSSIKRNEWILVPLWKPHWAFSRYDIKFLDDPDLIMGGIESVHTLTOLGLENDDFDAYYVFDHFYW 
SDDLILPLMDKNDKEPGKEYRNAVEFVEKNKEIVKTWVPEKYXTLFD 



ATGTATAAATTATTTTTATTTTTTATTATTTTTATGTTTTTGTCTTGTGATGAAAAAAAGAGTTCAAAGAATTTAA 
AATCGGTAAAAATTGGATATGTGAATTGGGGTGGAGAAACGGCAGCTACAAATGTATTAAAGGTTGTTTTTGAGAA 
AATGGGCTACAATGCAGAAATATTTTCAGTTACTACGTCTATAATGTATCAATACTTAGCATCTGGAAAGATAGAC 
GGTACGGTGTCTTCTTGGGTTCCTACAGCCGATAAATTTTATTATGAAAAACTGAAAACAAAGTTTGTTGATCTTG 
GTGCAAATTATGAAGGAACCATTCAAGGTTTTGTGGTGCCAAGCTATGTTCCAATTTCCAGCATTAGTGAGCTTAA 
GGGTAAAGGTGATAAGTTTAAAAACAAAATGATTGGCATAGATGCTGGTGCGGGAACTCAAATTGTTACAGAACAA 
GCGCTTAATTATTATGGATTAAGTAAAGAGTATGAGCTAGTTCCTTCAAGTGAGAGTGTTATGCTTGCAAGTTTAG 
ATTCTTCAATAAAGAGAAACGAATGGATTTTAGTTCCTTTGTGGAAGCCTCATTGGGCTTTTTCTAGGTATGATAT 
TAAGTTTCTTGATGATCCTGATTTAATTATGGGGGGAATTGAGAGCGTGCATACTCTTGTTAGACTTGGTCTTGAA 
AATGATGATTTTGATGCATATTATGTTTTTGATCATTTTTATTGGAGCGATGATTTAATATTGCCCTTAATGGATA 
AAAATGATAAAGAGCCAGGCAAAGAATACCGCAATGCGGTTGAATTTGTTGAAAAGAATAAAGAGATTGTAAAGAC 
GTGGGTTCCAGAAAAATATAAGACCTTATTTGATTAA 



TGTGATGAAAAAAAGAGTTCAAAGAATTTAAAATCGGTAAAAATTGGATATGTGAATTGGGGTGGAGAAACGGCAG 
CTACAAATGTATTAAAGGTTGTTTTTGAGAAAATGGGCTACAATGCAGAAATATTTTCAGTTACTACGTCTATAAT 
GTATCAATACTTAGCATCTGGAAAGATAGACGGTACGGTGTCTTCTTGGGTTCCTACAGCCGATAAATTTTATTAT 
GAAAAACTGAAAACAAAGTTTGTTGATCTTGGTGCAAATTATGAAGGAACCATTCAAGGTTTTGTGGTGCCAAGCT 
ATGTTCCAATTTCCAGCATTAGTGAGCTTAAGGGTAAAGGTGATAAGTTTAAAAACAAAATGATTGGCATAGATGC 
TGGTGCGGGAACTCAAATTGTTACAGAACAAGCGCTTAATTATTATGGATTAAGTAAAGAGTATGAGCTAGTTCCT 
TCAAGTGAGAGTGTTATGCTTGCAAGTTTAGATTCTTCAATAAAGAGAAACGAATGGATTTTAGTTCCTTTGTGGA 
AGCCTCATTGGGCTTTTTCTAGGTATGATATTAAGTTTCTTGATGATCCTGATTTAATTATGGGGGGAATTGAGAG 
CGTGCATACTCTTGTTAGACTTGGTCTTGAAAATGATGATTTTGATGCATATTATGTTTTTGATCATTTTTATTGG 
AGCGATGATTTAATATTGCCCTTAATGGATAAAAATGATAAAGAGCCAGGCAAAGAATACCGCAATGCGGTTGAAT 
TTGTTGAAAAGAATAAAGAGATTGTAAAGACGTGGGTTCCAGAAAAATATAAGACCTTATTTGATTAA 



f810 .aa 



t810 .aa 



f810 .nt 



t810 .nt 



f 814 .aa 



m 
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MLVKRIVGKPITMLILFSLLLMISLYTFSRLKVDLLPGIDIPQISIHTVYPGASPREVEESVSRVLESGLSSVKNL 
KNIYSVSSKESSTVSLEFYHGTDLDLVLNEIRDALELVKSSLPSKSQTPRIFRYNLKNIPVMEIVINSVRPVSELK 
RYADEIIKPGLERLDGVAIVTVNGGSKKRVLIEVSQNRLESYGLSLSRISSIIASQNLELSAGNILENNLEYLVEV 
■SGKFKSIEEIGNWIAYKIPDISSGINLSPIEIKLKDIANIKTDFEDLSEYVEYNGLPSISLSVQKRSDSNSIAVS 
NVVMNEIEKLKLSMPKDMKLEIASDSTDFIKASISTVWSAYFGAMLAIFVIFFFLRSFRATIIIGISIPIAIVLT 
FCLMYFVNISLNIMSLAGLALGIGMWDCSIWIDNIYKYRQKGAKLISSSILGAQEMMLPITSSTFTSICVFGPF 
LIFKSELGVYGDFFKDFTFTIVISLGVSLLVAIFLVPVLSSHYVGLYTSFQKNIKNAFIRKIDAFFASIYYFLEFL 
YINLLNIVLNHKLIFGLIVFFSFIGSLLLGLLLDVTTFTRGKENSITINLNFPHKTNLEYAKFYSNRFLEIVKSEA 
KGYKSIIATLRADRITFNVLFPLKEESRDNLTQSVDYDSIKYKIMNRIGWLYPEFNIEPSISGNALGGGDSIKIKI 
SANDFEYIXDYGKILVSMLKKEIPELWPRLSISDFQLQIGVEIDRALVYOTGIDMNTILNELKANINGVVAGQYV 
EKGLNYDIVLKLDRMDVKNLKDLEKIFITNSSGVKI PFS S IATFEKTNKAES IYRENQALTIYLNAGI S PDDNLTQ 
VTAKVVDFINMKVPHKEGITLKVEGEYNEFSNIMKQFKIIIMMAIIVVFGIKASQFESFLKPFIIIFTIPLTAIGV 
VL IHFLAGEKLS I F AAIGMLMLVGVWWTGIVLVDYTGLLIKRGFGLREAI I ESCRSRLRP I LMSSLT S I IGL I PM 
AFSSGSGNELLKPIAFTFIGGMTASTFLTLFFIPMLFEIFPTCFKFQI 



RLKVDLLPGIDIPQISIHTVYPGASPREVEESVSRVLESGLSSVKNLKNIYSVSSKESSTVSLEFYHGTDLDLVLN 
EIRDALELVKSSLPSKSQTPRIFRYNLKNIPVMEIVINSVRPVSELKRYADEIIKPGLERLDGVAIVTVNGGSKKR 
VLIEVSQNRLESYGLSLSRISSIIASQNLELSAGNILENNIiEYLVEVSGKFKSIEEIGNWIAYKIPDISSGINLS 
PIEIKLKDIANIKTDFEDLSEYVEYNGLPSISLSVQKRSDSNSIAVSNVVMNEIEKLKLSMPKDMKLEIASDSTDF 
IKASISTVVNSAYFGAMLAIFVIFFFLRSFRATIIIGISIPIAIVLTFCLJXnTFVNISLNIM 

SIWIDNIYKYRQKGAKLISSSILGAQEMMLPITSSTFTSICVFGPFLIFKSELGVYGDFFKDFTFTIVISLGVSL 
LVAIFLVPVLSSHYVGLYTSFQKNIKNAFIRKIDAFFASIYYFLEFLYINLLNIVLNHKLIFGLIVFFSFIGSLLL 
GLLLDVTTFTRGKENSITINLNFPHKTNLEYAKFYSNRFLEIVKSEAKGYKSIIATLRADRITFNVLFPLKEESRD 
NLTQSVDYDSIKYKIMNRIGNLYPEFNIEPSISGNALGGGDSIKIKISANDFEYIKDYGKILVSMLKKEIPELVNP 
RLSISDFQLQIGVEIDRALVYNYGIDMNTILNELKANINGVVAGQYVEKGL^ 

NS SGVKI PF S S IATFEKTNKAES IYRENQALTIYLNAGI SPDDNLTQVTAKVVDF INNKVPHKEGITLKVEGE YNE 
FSNIMNQFKIIIMMAI IVVFGIiXIASQFESFLKPFI IIFTIPLTAIGVVLIHFLAGEKLSIFAAIGMLMLVGVVVNT 
GIVLVDYTGLLIKRGFGLREAIIESCRSRLRPILMSSLTSIIGLIPMAFSSGSGNELLKPIAFTFIGGMTASTFLT 
LFFIPMLFEIFPTCFKFQI 



ATGTTGGTAAAGAGAATAGTTGGCAAACCAATAACAATGTTGATTTTATTTTCA'PTGTTATTGATGATAAGTTTGT 
ATACCTTTTCAAGATTAAAAGTAGATCTTTTGCCGGGAATTGACATTCCCCAAATAAGTATTCACACTGTTTATCC 
TGGCGCTTCTCCTAGAGAAGTTGAAGAGAGTGTTTCTAGAGTCCTTGAGAGTGGCTTGAGTTCGGTAAAGAATTTA 
AAAAATATATATAGTGTATCTTCCAAAGAAAGCAGCACCGTTTCACTTGAATTTTATGATGGAACCGATTTAGATT 
TGGTTTTAAATGAAATTCGAGATGCTCTTGAATTGGTAAAATCTTCATTGCCCAGCAAATCACAGACCCCAAGAAT 
TTTTAGATACAATCTTAAAAACATCCCTGTAATGGAAATTGTTATTAATTCTGTAAGGCCAGTTTCTGAGCTTAAA 
AGATATGCCGATGAAATCATTAAACCTGGGCTTGAAAGGCTTGATGGAGTTGCAATTGTTACTGTTAATGGTGGAA 
GTAAAAAGCGTGTTTTAATTGAAGTTTCTCAAAACAGGCTGGAGTCTTATGGGCTTTCTTTGTCAAGAATATCTTC 
AATTATAGCATCCCAAAATTTGGAACTTTCAGCTGGCAATATATTGGAGAACAACTTGGAATATTTGGTTGAAGTT 
TCTGGAAAATTTAAATCAATTGAAGAGATAGGTAATGTGGTCATAGCTTATAAGATACCCGACATTTCTTCTGGCA 
TAAATTTATCTCCTATTGAGATAAAACTCAAAGATATTGCTAATATTAAAACCGATTTTGAAGATTTGTCTGAATA 
TGTTGAATATAATGGGTTGCCTTCAATTTCTTTGTCGGTTCAAAAACGTAGTGATTCTAATTCTATTGCAGTTTCT 
AATGTTGTTATGAATGAAATAGAAAAATTGAAATTATCTATGCCTAAAGATATGAAATTGGAGATTGCTTCTGATA 
GTACTGATTTTATTAAAGCATCCATTTCAACGGTTGTAAATTCAGCCTATTTTGGGGCCATGCTTGCAATATTTGT 
TATTTTTTTCTTTTTAAGAAGCTTTAGGGCCACAATAATTATTGGAATTTCTATTCCAATAGCAATTGTTTTGACC 
TTTTGTTTAATGTATTTTGTAAATATTTCTCTTAATATTATGAGTCTTGCGGGTCTTGCACTTGGGATTGGAATGG 
TTGTTGACTGTTCAATTGTTGTAATAGACAATATATACAAATATAGGCAAAAAGGAGCAAAGCTTATTTCGTCTTC 
TATTCTCGGAGCTCAGGAGATGATGTTGCCTATTACATCTTCAACTTTTACTTCTATTTGTGTTTTTGGTCCATTT 
CTTATTTTCAAATCAGAACTTGGGGTATATGGAGATTTTTTCAAAGACTTTACATTTACGATTGTTATTTCCTTGG 
GTGTTTCTCTTTTAGTTGCAATTTTTTTGGTTCCTGTTTTATCAAGCCACTATGTCGGTTTATACACAAGTTTCCA 
AAAGAATATTAAGAATGCTTTTATTAGGAAAATCGATGCCTTTTTTGCTAGTATTTATTATTTTTTAGAGTTTTTG 



t814.aa 



f 814 .nt 



* # 
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TATATCAATTTATTAAATATAGTTTTAAATCACAAATTGATTTTTGGGTTGATTGTTTTTTTTAGTTTTATTGGCA 
GCTTGCTTTTAGGATTATTGTTAGATGTGACAACTTTTACTAGAGGGAAAGAGAACTCAATTACTATTAATTTAAA 
TTTTCCCCACAAAACTAATTTGGAATATGCAAAATTTTATTCTAATAGATTTTTAGAAATTGTAAAAAGTGAGGCT 
AAAGGATATAAAAGTATTATTGCTACTTTGCGTGCTGATAGAATAACTTTCAACGTATTGTTTCCTCTCAAAGAAG 
AATCAAGAGATAATTTAACCCAAAGCGTAGATTACGATTCTATTAAATATAAAATTATGAATCGTATTGGTAATCT 
TTATCCTGAATTTAATATTGAGCCTTCCATTAGTGGCAATGCTTTAGGTGGTGGAGATTCTATTAAAATTAAAATT 
TCGGCCAATGATTTTGAATATATAAAAGATTATGGAAAAATTTTAGTTTCCATGTTAAAAAAGGAAATTCCCGAAC 
TTGTAAATCCAAGGCTTAGCATAAGTGATTTTCAGCTTCAAATTGGCGTTGAGATAGACAGAGCGCTAGTTTATAA 
TTATGGTATTGACATGAATACCATTTTAAATGAGTTGAAGGCCAATATTAATGGTGTTGTTGCTGGGCAATATGTG 
GAGAAGGGACTTAATTATGATATTGTTCTTAAGCTTGATAGAATGGATGTTAAAAATTTAAAAGATTTAGAAAAAA 
TATTTATTACAAATTCATCTGGAGTTAAAATTCCTTTTTCATCAATAGCCACCTTTGAAAAAACCAATAAAGCCGA 
ATCTATTTACAGAGAAAATCAAGCTTTAACCATTTATCTTAATGCGGGTATTTCTCCAGATGATAATTTAACCCAA 
GTAACCGCAAAAGTTGTAGATTTTATTAATAATAAGGTGCCCCATAAAGAAGGCATAACTCTTAAGGTTGAAGGAG 
.AATATAATGAATTTTCAAATATCATGAATCAGTTTAAAATAATCATTATGATGGCTATTATTGTTGTGTTTGGTAT 
TATGGCTTCTCAATTTGAATCTTTTTTAAAACCCTTTATTATTATTTTTACAATTCCTTTAACGGCAATAGGGGTT 
GTGCTTATACATTTTCTTGCAGGAGAAAAGCTTTCTATTTTTGCTGCAATTGGTATGCTTATGCTTGTTGGTGTTG 
TGGTAAATACAGGAATTGTTCTTGTAGACTATACTGGTTTATTGATCAAGAGGGGATTTGGCCTAAGAGAAGCAAT 
TATTGAATCTTGTCGTTCAAGGCTTAGGCCAATTTTAATGTCTTCTTTGACCTCAATAATAGGGCTTATTCCAATG 
GCATTTTCTAGCGGAAGTGGAAATGAACTTCTAAAACCAATTGCATTTACTTTTATTGGCGGAATGACAGCTAGCA 
CATTTCTTACTTTGTTTTTTATTCCCATGCTTTTTGAAATTTTTCCAACATGTTTCAAGTTTCAAATCTAG 

t814 .nt 

AGATTAAAAGTAGATCTTTTGCCGGGAATTGACATTCCCCAAATAAGTATTCACACTGTTTATCCTGGCGCTTCTC 
CTAGAGAAGTTGAAGAGAGTGTTTCTAGAGTCCTTGAGAGTGGCTTGAGTTCGGTAAAGAATTTAAAAAATATATA 
TAGTGTATCTTCCAAAGAAAGCAGCACCGTTTCACTTGAATTTTATCATGGAACCGATTTAGATTTGGTTTTAAAT 
GAAATTCGAGATGCTCTTGAATTGGTAAAATCTTCATTGCCCAGCAAATCACAGACCCCAAGAATTTTTAGATACA 
ATCTTAAAAACATCCCTGTAATGGAAATTGTTATTAATTCTGTAAGGCCAGTTTCTGAGCTTAAAAGATATGCCGA 
TGAAATCATTAAACCTGGGCTTGAAAGGCTTGATGGAGTTGCAATTGTTACTGTTAATGGTGGAAGTAAAAAGCGT 
GTTTTAATTGAAGTTTCTCAAAACAGGCTGGAGTCTTATGGGCTTTCTTTGTCAAGAATATCTTCAATTATAGCAT 
CCCAAAATTTGGAACTTTCAGCTGGCAATATATTGGAGAACAACTTGGAATATTTGGTTGAAGTTTCTGGAAAATT 
TAAATCAATTGAAGAGATAGGTAATGTGGTCATAGCTTATAAGATACCCGACATTTCTTCTGGCATAAATTTATCT 
■CCTATTGAGATAAAACTCAAAGATATTGCTAATATTAAAACCGATTTTGAAGATTTGTCTGAATATGTTGAATATA 
ATGGGTTGCCTTCAATTTCTTTGTCGGTTCAAAAACGTAGTGATTCTAATTCTATTGCAGTTTCTAATGTTGTTAT 
GAATGAAATAGAAAAATTGAAATTATCTATGCCTAAAGATATGAAATTGGAGATTGCTTCTGATAGTACTGATTTT 
ATTAAAGCATCCATTTCAACGGTTGTAAATTCAGCCTATTTTGGGGCCATGCTTGCAATATTTGTTATTTTTTTCT 
TTTTAAGAAGCTTTAGGGCCACAATAATTATTGGAATTTCTATTCCAATAGCAATTGTTTTGACCTTTTGTTTAAT 
GTATTTTGTAAATATTTCTCTTAATATTATGAGTCTTGCGGGTCTTGCACTTGGGATTGGAATGGTTGTTGACTGT 
TCAATTGTTGTAATAGACAATATATACAAATATAGGCAAAAAGGAGCAAAGCTTATTTCGTCTTCTATTCTCGGAG 
CTCAGGAGATGATGTTGCCTATTACATCTTCAACTTTTACTTCTATTTGTGTTTTTGGTCCATTTCTTATTTTCAA 
ATCAGAACTTGGGGTATATGGAGATTTTTTCAAAGACTTTACATTTACGATTGTTATTTCCTTGGGTGTTTCTCTT 
TTAGTTGCAATTTTTTTGGTTCCTGTTTTATCAAGCCACTATGTCGGTTTATACACAAGTTTCCAAAAGAATATTA 
AGAATGCTTTTATTAGGAAAATCGATGCCTTTTTTGCTAGTATTTATTATTTTTTAGAGTTTTTGTATATCAATTT 
ATTAAATATAGTTTTAAATCACAAATTGATTTTTGGGTTGATTGTTTTTTTTAGTTTTATTGGCAGCTTGCTTTTA 
GGATTATTGTTAGATGTGACAACTTTTACTAGAGGGAAAGAGAACTCAATTACTATTAATTTAAATTTTCCCCACA 
AAACTAATTTGGAATATGCAAAATTTTATTCTAATAGATTTTTAGAAATTGTAAAAAGTGAGGCTAAAGGATATAA 
AAGTATTATTGCTACTTTGCGTGCTGATAGAATAACTTTCAACGTATTGTTTCCTCTCAAAGAAGAATCAAGAGAT 
AATTTAACCCAAAGCGTAGATTACGATTCTATTAAATATAAAATTATGAATCGTATTGGTAATCTTTATCCTGAA'T 
.TTAATATTGAGCCTTCCATTAGTGGCAATGCTTTAGGTGGTGGAGATTCTATTAAAATTAAAATTTCGGCCAATGA 
TTTTGAATATATAAAAGATTATGGAAAAATTTTAGTTTCCATGTTAAAAAAGGAAATTCCCGAACTTGTAAATCCA 
AGGCTTAGCATAAGTGATTTTCAGCTTCAAATTGGCGTTGAGATAGACAGAGCGCTAGTTTATAATTATGGTATTG 
ACATGAATACCATTTTAAATGAGTTGAAGGCCAATATTAATGGTGTTGTTGCTGGGCAATATGTGGAGAAGGGACT 
TAATTATGATATTGTTCTTAAGCTTGATAGAATGGATGTTAAAAATTTAAAAGATTTAGAAAAAATATTTATTACA 
AATTCATCTGGAGTTAAAATTCCTTTTTCATCAATAGCCACCTTTGAAAAAACCAATAAAGCCGAATCTATTTACA 
GAGAAAATCAAGCTTTAACCATTTATCTTAATGCGGGTATTTCTCCAGATGATAATTTAACCCAAGTAACCGCAAA 
AGTTGTAGATTTTATTAATAATAAGGTGCCCCATAAAGAAGGCATAACTCTTAAGGTTGAAGGAGAATATAATGAA 
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TTTTCAAATATCATGAATCAGTTTAAAATAATCATTATGATGGCTATTATTGTTGTGTTTGGTATTATGGCTTCTC 
AATTTGAATCTTTTTTAAAACCCTTTATTATTATTTTTACAATTCCTTTAACGGCAATAGGGGTTGTGCTTATACA 
TTTTCTTGCAGGAGAAAAGCTTTCTATTTTTGCTGCAATTGGTATGCTTATGCTTGTTGGTGTTGTGGTAAATACA 
GGAATTGTTCTTGTAGACTATACTGGTTTATTGATCAAGAGGGGATTTGGCCTAAGAGAAGCAATTATTGAATCTT 
GTCGTTCAAGGCTTAGGCCAATTTTAATGTCTTCTTTGACCTCAATAATAGGGCTTATTCCAATGGCATTTTCTAG 
CGGAAGTGGAAATGAACTTCTAAAACCAATTGCATTTACTTTTATTGGCGGAATGACAGCTAGCACATTTCTTACT 
TTGTTTTTTATTCCCATGCTTTTTGAAATTTTTCCAACATGTTTCAAGTTTCAAATCTAG 

f818.aa 

MLKmSKLIIQLKVVMMIYLKKMGNDMTKFYNYRIEIVSNLSLELDWECIE 
GEKHGNGVWPEENFILIIYTSNQSIVERLKDIVDDLNRSYPTEGINLFVLRNS 

t818 .aa 

KKMGNDMTKFYNYRIEIVSNLSLELDVFECIEKIEQELGESIYYSKIGNVYGKGKKGEKHGNGVWPEENFILIIYT 
□ SNQS I VERLKD I VDDLNRS YPTEGI NLF VLRNS 

|| f818.nt 

ATGTTGAAAAATCATTCAAAATTAATAATTCAACTAAAAGTAGTTATGATGATTTATTTGAAGAAGATGGGGAATG 
f|j ATATGACTAAATTTTATAATTATAGGATTGAAATAGTTTCTAACTTATCTTTAGAGCTTGATGTTTTTGAATGTAT 
;y AGAAAAAATAGAGCAAGAGTTAGGAGAGTCTATATATTATTCTAAGATAGGAAATGTTTATGGAAAAGGTAAGAAG 
Q GGAGAAAAGCATGGTAATGGCGTTTGGCCTGAAGAAAATTTTATTTTGATTATTTATACCTCCAATCAGTCTATTG 

TTGAGCGATTAAAGGATATTGTGGATGATTTGAATCGTTCTTACCCTACAGAAGGGATTAATCTTTTTGTTTTGAG 

AAATTCTTAA 

•'; t818.nt 

^ AAGAAGATGGGGAATGATATGACTAAATTTTATAATTATAGGATTGAAATAGTTTCTAACTTATCTTTAGAGCTTG 
ATGTTTTTGAATGTATAGAAAAAATAGAGCAAGAGTTAGGAGAGTCTATATATTATTCTAAGATAGGAAATGTTTA 
TGGAAAAGGTAAGAAGGGAGAAAAGCATGGTAATGGCGTTTGGCCTGAAGAAAATTTTATTTTGATTATTTATACC 
TCCAATCAGTCTATTGTTGAGCGATTAAAGGATATTGTGGATGATTTGAATCGTTCTTACCCTACAGAAGGGATTA 
ATCTTTTTGTTTTGAGAAATTCTTAA 

f820.aa 

MLNNTYRIKTILTIFLAITLLTIYKYFTLMAFNNSPDNTISLKSNDIAKRGTIYDRNGKPIAFSSKSYSIGTNPQK 
IENIVSTSETLGAILQINSRILKEKLSSNKGFLYIKRKIKREESDLIKRIQAEGRLSNITLYPDYTRIYPFRNTTS 
NITGFVGTDNLGLEGIEFSLNSILGKDKTKQQFLNEEPETNNIHLTIDMDIQQGVSKIAKKYFKENNPESLITLVM 
NSQNGEILSMVQFPQYDANFYSKYPEEIRKNLSSSLTYEPGSINKIFTVAIILESGKLNLEEKFLDNGIYQKQFPS 
GEKITIKTLNPPYKHIDSTEILIYSSNVGIAYITEKVSNEYFYKKLLDFGFGEKVGVPFPGETKGLLNHYSKWSGR 
SKATIGFGQEIGVSAVQILQAASILSTOTGIMLXPRIIKKISNDKGENIKSFDKEEIRKVISKNSAQKVLKMMREW 
NKGGIPNLKIKNLDISAKSGTSQAIDRKTGKYSEEDYTSSILAIYPTEQPKYIIYIVYRYPKKIIYGTRIAAPMAK 
EIIEFIEHQQNTIAYKKIKMPSKIKIPKAETNYKNKTYLPNFINLSKREAIDILKYYKNTMKIKINGDGFVYKQSI 
SPNTKLEDITELELYLK 

t820 .aa 

FMNSPDNTISLKSNDIAKRGTIYDRNGKPIAFSSKSYSIGTNPQKIENIVSTSETLGAILQINSRILKEKLSSNKG 
FLYJ.KRKIKREESDLIKRIQAEGRLSNITLYPDYTRIYPFRNTTSNITGFVGTDNLGLEGIEFSLNSILGKDKTKQ 
QFLNEEPETNNIHLTIDMDIQQGVSKIAKKYFKENNPESLITLVMNSQNGEILS1WQFPQYDANFYS.KYPEEIRKN 
LSSSLTYEPGSINKIFTVAIILESGKLNLEEKFLDNGIYQKQFPSGEKITIKTLNPPYKHIDSTEILIYSSNVGIA 
YITEKVSNEYFYKKLLDFGFGEKVGVPFPGETKGLLNHYSKWSGRSKATIGFGQEIGVSAVQILQAASILSNNGIM 
LKPRI IKKISNDKGENIKEFDKEEIRKVTSKNSAQKVLKMMREVVNKGGIPNLKIKNLDISAKSGTSQAIDRKTGK 
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YSEEDYTSSILAIYPTEQPKYIIYIWRYPKKIIYGTRIAAPMAKEIIEFIEHQQNTIAYKKIKMPSKIKIPKAET 
- NYKNKT YL PNF INL SKREAI D I LKY YKNTMK I KINGDGF VYKQS I S PNTKL EDIT EL E L Y LK 

f820.nt 

ATGCTTAATAACACTTATCGAATAAAAACAATATTAACAATATTCTTGGCTATAACTTTGTTAACTATTTACAAAT 
ATTTCACACTAATGGCCTTCAATAACAGCCCAGACAACACAATATCTTTAAAGTCAAATGATATTGCCAAAAGAGG 
AACAATTTATGATAGAAATGGCAAACCAATAGCATTCTCTTCAAAATCCTACTCAATTGGTACAAATCCTCAAAAA 
ATAGAAAATATTGTAAGCACATCTGAAACTCTTGGTGCAATACTTCAAATTAATTCAAGAATTTTAAAGGAAAAGC 
TTTCCTCTAACAAAGGGTTTTTATATATAAAAAGAAAAATAAAAAGAGAAGAATCAGATTTAATAAAAAGAATTCA 
AGCTGAAGGCAGGCTTTCAAACATCACTTTATATCCTGATTACACAAGAATTTATCCCTTCAGGAATACCACAAGC 
AATATTACTGGTTTTGTAGGAACAGATAATCTTGGCCTTGAGGGCATTGAATTTTCCCTAAATAGCATATTAGGAA 
AAGATAAAACCAAGCAACAATTTTTAAATGAGGAGCCAGAAACAAACAACATCCACTTAACAATAGACATGGATAT 
ACAACAAGGTGTTAGCAAAATAGCTAAAAAATACTTTAAAGAAAATAATCCTGAAAGTTTAATTACCTTGGTAATG 
AACTCCCAAAATGGAGAAATATTATCCATGGTTCAATTTCCTCAATATGATGCAAACTTTTATTCTAAATATCCTG 
AAGAAATCCGAAAAAACCTTTCTTCATCTCTAACCTATGAGCCCGGAAGCATTAATAAAATTTTTACAGTTGCAAT 
AATATTAGAAAGTGGAAAATTAAATTTAGAAGAAAAATTTTTAGACAATGGAATATATCAAAAACAATTTCCATCA 
GGAGAAAAAATTACAATCAAAACATTAAATCCCCCCTATAAACATATCGACTCTACAGAGATTTTAATTTATTCAT 
CAAATGTTGGAATAGCTTACATTACTGAAAAAGTTAGCAATGAATACTTTTATAAAAAACTTTTAGATTTTGGCTT 
TGGGGAAAAAGTTGGAGTTCCATTTCCCGGAGAAACAAAAGGACTGCTAAATCATTATTCAAAATGGTCAGGACGA 
AGTAAAGCTACAATTGGATTTGGACAAGAAATAGGAGTGTCAGCGGTTCAAATATTACAAGCTGCAAGCATACTAA 
GCAATAATGGAATAATGCTAAAACCTAGAATAATAAAAAAAATAAGCAACGATAAAGGAGAAAATATTAAAGAATT 
. TGATAAAGAAGAAATAAGAAAAGTAATATCCAAAAATTCAGCACAAAAAGTTTTAAAAATGATGAGAGAAGTTGTA 
AATAAAGGTGGAATTCCAAATCTTAAAATTAAAAATCTTGACATTTCTGCAAAAAGTGGAACATCTCAAGCTATTG 
ATAGAAAAACGGGAAAATACTCAGAAGAAGACTATACATCTTCTATATTGGCAATATACCCCACAGAACAACCAAA 
ATATATTATTTACATTGTATACAGATACCCAAAAAAAATAATATACGGAACAAGAATAGCAGCCCCAATGGCAAAA 
GAAATAATAGAATTTATTGAGCACCAACAAAATACAATAGCATATAAAAAAATTAAAATGCCATCAAAAATCAAGA 
TCCCTAAAGCTGAAACTAATTACAAAAACAAAACATACTTACCAAATTTTATCAACCTTTCTAAAAGAGAAGCAAT 
AGACATACTAAAATACTATAAAAATACTATGAAAATAAAAATAAATGGCGATGGATTTGTTTACAAGCAAAGTATA 
TCCCCCAATACAAAATTAGAAGATATAACAGAGCTTGAACTGTATTTAAAATAA 

t8-2 0 .nt 

TTCAATAACAGCCCAGACAACACAATATCTTTAAAGTCAAATGATATTGCCAAAAGAGGAACAATTTATGATAGAA 
ATGGCAAACCAATAGCATTCTCTTCAAAATCCTACTCAATTGGTACAAATCCTCAAAAAATAGAAAATATTGTAAG 
CACATCTGAAACTCTTGGTGCAATACTTCAAATTAATTCAAGAATTTTAAAGGAAAAGCTTTCCTCTAACAAAGGG 
TTTTTATATATAAAAAGAAAAATAAAAAGAGAAGAATCAGATTTAATAAAAAGAATTCAAGCTGAAGGCAGGCTTT 
CAAACATCACTTTATATCCTGATTACACAAGAATTTATCCCTTCAGGAATACCACAAGCAATATTACTGGTTTTGT 
AGGAACAGATAATCTTGGCCTTGAGGGCATTGAATTTTCCCTAAATAGCATATTAGGAAAAGATAAAACCAAGCAA 
CAATTTTTAAATGAGGAGCCAGAAACAAACAACATCCACTTAACAATAGACATGGATATACAACAAGGTGTTAGCA 
AAATAGCTAAAAAATACTTTAAAGAAAATAATCCTGAAAGTTTAATTACCTTGGTAATGAACTCCCAAAATGGAGA 
-AATATTATCCATGGTTCAATTTCCTCAATATGATGCAAACTTTTATTCTAAATATCCTGAAGAAATCCGAAAAAAC 
CTTTCTTCATCTCTAACCTATGAGCCCGGAAGCATTAATAAAATTTTTACAGTTGCAATAATATTAGAAAGTGGAA 
AATTAAATTTAGAAGAAAAATTTTTAGACAATGGAATATATCAAAAACAATTTCCATCAGGAGAAAAAATTACAAT 
CAAAACATTAAATCCCCCCTATAAACATATCGACTCTACAGAGATTTTAATTTATTCATCAAATGTTGGAATAGCT 
TACATTACTGAAAAAGTTAGCAATGAATACTTTTATAAAAAACTTTTAGATTTTGGCTTTGGGGAAAAAGTTGGAG 
TTCCATTTCCCGGAGAAACAAAAGGACTGCTAAATCATTATTCAAAATGGTCAGGACGAAGTAAAGCTACAATTGG 
ATTTGGACAAGAAATAGGAGTGTCAGCGGTTCAAATATTACAAGCTGCAAGCATACTAAGCAATAATGGAATAATG 
C TAAAAC CTAGAATAATAAAAAAAAT AAGCAACGAT AAAGGAGAAAATAT T AAAGAATTTGATAAAGAAGAAATAA 
GAAAAGTAATATCCAAAAATTCAGCACAAAAAGTTTTAAAAATGATGAGAGAAGTTGTAAATAAAGGTGGAATTCC 
AAATCTTAAAATTAAAAATCTTGACATTTCTGCAAAAAGTGGAACATCTCAAGCTATTGATAGAAAAACGGGAAAA 
TACTCAGAAGAAGACTATACATCTTCTATATTGGCAATATACCCCACAGAACAACCAAAATATATTATTTACATTG 
TATACAGATACCCAAAAAAAATAATATACGGAACAAGAATAGCAGCCCCAATGGCAAAAGAAATAATAGAATTTAT 
TGAGCACCAACAAAATACAATAGCATATAAAAAAATTAAAATGCCATCAAAAATCAAGATCCCTAAAGCTGAAACT 
AATTACAAAAACAAAACATACTTACCAAATTTTATCAACCTTTCTAAAAGAGAAGCAATAGACATACTAAAATAC? 



TABLE 1. Nucleotide and Amino Acid Sequences 

ATAAAAATACTATC-AAAATAAAAATAAATGGCGATGGATTTGTTTACAAGCAAAGTATATCCCCCAATACAAAATT 
AGAAGATATAACAGAGCTTGAACTGTATTTAAAATAA 



MAKNNLLVFFIAIIFVFVSIIWFYNSLGKDYVKSGGEIVENLEKDLNDYLKENDAKEREKIFLRIRELI^KEKEI 
SSYFISRFYLARAVYFQSQAQYDEAIKDLDIVIKAKGIESEIAFLNKAAVYEKMGLKEDALLVYEDLINSTSLDFL 
KVRALLSKJVILIEEKDKELAVKVYEEIVKFPYEJJNLYINMANNKILELKQN 

tS31.aa 

YNSLGKDYVKSGGEIVENLEKDLNDYLKENDAKEREKIFLRIRELISKEKEISSYFISRFYLARAVYFQSQAQYDE 
AIKDLDIVIKAKGIESEIAFLNKAAVYEKMGLKEDALLv-YEDLINSTSLDFLKVRALLSKAILIEEKDKELAVKVY 
EEIVKF PYENNLYINKANNKI LELKQN 

f831 .nt 

ATGGCTAAAAATAATCTTTTAGTTTTCTTTATTGCTATTATTTTTGTGTTTGTGTCTATTATTGT^GT^TTTTATA 
ATTCTTTAGGCAAGGATTATGTAAAGAGTGGCGGAGAAATAGTAGAAAATCTTGAAAAAGATTTAAATGATTAT-T 
AAAAGAAAATGATGCCAAAGAGAGAGAAAAAATATTTCTTAGGATAAGGGAGCTTATTTCAAAGGAAAAAGAAATT 
TCATCTTATTTTATTTCAAGGTTCTATTTAGCCAGAGCTGTTTATTTCCAAAGTCAAGCACAGT^TGATGAGGCTA 
TTAAAGATTTAGATATTGTTATTAAGGCAAAAGGTATTGAAAGTGAAATTGCTTTTCTTAATAAAGCTGCAGTTTA 
TGAAAAAATGGGATTAAAAGAAGATGCTTTGTTAGTTTATGAAGATCTTATCAATAGTACTAGTT^GGATTTTT^A 
AAGGTAAGAGCTCTTTTGAGTAAGGCAATATTGATTGAGGAAAAAGATAAAGAGCTTGCTGTGAAAGTATACGAAG 
AGATTGTTAAGTTTCCGTATGAAAATAATTTATATATAAATATGGCAAATAATAAAATTTTAGAACTTAAGCAAAA 



TATAATTCTTTAGGCAAGGATTATGTAAAGAGTGGCGGAGAAATAGTAGAAAATCTTGAAAAAGATTTAAATGATT 

ATTTAAAAGAAAATGATGCCAAAGAGAGAGAAAAAATATTTCTTAGGATAAGGGAGCTTATTTCAAAGGAAAAAGA 

AATTTCATCTTATTTTATTTCAAGGTTCTATTTAGCCAGAGCTGTTTATTTCCAAAGTCAAGCACAGTATGATGAG 

GCTATTAAAGATTTAGATATTGTTATTAAGGCAAAAGGTATTGAAAGTGAAATTGCTTTTCTTAATAAAGCTGCAG 

TTTATGAAAAAATGGGATTAAAAGAAGATGCTTTGTTAGTTTATGAAGATCTTATCAATAGTACTAGTTTGGAT^T 

TTTAAAGGTAAGAGCTCTTTTGAGTAAGGCAATATTGATTGAGGAAAAAGATAAAGAGCTTGCTGTGAAAGTATAC 

GAAGAGATTGTTAAGTTTCCGTATGAAAATAATTTATATATAAATATGGCAAATAATAAAATTTTAGAAC^TAAGC 
AAAATTAA 



MKAIGNAILLNMPLIFSIGISIGVARMGQGTAALGGLIGYLTFNITENYFIEAFSGLVEAETMSSVGRINFFGVQT 
LJJTGIAGSLAVGLLVGYLHNKFYNMKLPKPFVFFSECHFVPIVIILPFCVFLAIFFCLIWSSFDDLIASLGLFVFR 
FEYFGSFLYGFLNRLLLPLGLHSILSFPFEFTSLGGVEIVNGDTVRGLKNIFYAQLLDPSLGKFSSGFAKISSGFY 
LSIMFGLPGAALGVYKGIVHEDKNKVAALLFSGALTAFLTGITEPLEFLFIFTAPLLYFA^iAAYSGFAELLANFFN 
VTIGNSFSTGFLDFFMFGILQGNSKTNWISVLPLGAMFFALYYFTFSWLYRYFDFQIFVTDDPFFEGQEGKLESLG 
IAHLLIQGLGGFDNITKLDVCSTRLff/DV^/NTELVDNNLLKEAGVLKIGLVNGKVQLFYGSNVYYIKNAIDTYSPK 
SLFEASVMVAVDNVKKGFKTYIEMKEDKKLEKQGKSGKTYKLSELEED 

t843 .aa 

RMGQGTAALGGLIGYLTFNITENYFIEAFSGLVEAETMSSVGRINFFGVQTLNTGIAGSLAVGLLVGYLHNKFYNM 
KLPKPFVFFSECHFVPIVIILPFCVFLAIFFCLIWSSFDDLIASLGLFVFRFEYFGSFLYGFLNPLLLPLGr.HSIL 
SFPFEFTSLGGVEIWGDTVRGLKNIFYAQLLDPSLGKFSSGFAKISSGFYLSIMFGLPGAALGVYKGIVFEDK21K 
VAALLFSGALTAFLTGITEPLEFLFIFTAPLLYFVHAAYSGF.ALLLANFFNVTIGNSFSTGFLDF^NFQILQGNSX 
TNWISVLPLGAMFFALYYFTFSWLYRYFDFQIFVTDDPFFEGQEGKLESLGIAHLLIQGLGGFDNI^KLDVCSTRL 



# • 
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HVDVWTELVDNNLLKEAGVLKIGLW^ 
EDKKLEKQGKSGKTYKLSELEED 

f 843 .nt 

ATGAAGGCTATAGGCAATGCTATTCTTCTCAA.TATGCCTTTAATTTTTTCTATTGGAATTTCTATTGGAGTTGCAA 
GAATGGGGCAGGGAACAGCGGCTTTGGGAGGCCTTATTGGTTATTTAACATTTAATATTACTGAAAATTATTTTAT 
TGAGGCTTTTTCAGGGCTTGTTGAAGCAGAGACAATGTCTTCTGTTGGGCGTATAAATTTTTTTGGTGTTCAAACT 
TTAAATACGGGAATTGCAGGTTCTTTAGCGGTAGGCCTTTTAGTTGGATATTTGCATAACAAATTTTATAATATGA 
AGCTACCCAAACCTTTTGTGTTTTTTTCAGAGTGCCATTTTGTGCCTATAGTAATAATTTTACCCTTTTGTGTTTT 
TTTGGCTATATTTTTTTGTTTGATTTGGTCAAGTTTTGACGATTTAATTGCATCTTTAGGTTTGTTTGTTTTTAGG 
TTTGAATATTTTGGCAGTTTTCTTTATGGATTTTTAAATAGGCTTTTATTGCCTTTGGGGTTGCATTCTATTTTAT 
CTTTTCCTTTTGAGTTTACTTCTTTGGGAGGAGTGGAGATAGTTAATGGCGATACTGTTAGAGGTCTTAAGAATAT 
ATTTTATGCTCAGCTATTAGACCCATCACTTGGTAAATTTTCATCAGGCTTTGCCAAAATTAGCAGTGGATTTTAT 
CTATCTATTATGTTTGGACTGCCCGGAGCAGCATTAGGGGTTTACAAGGGTATTGTTCATGAAGATAAAAATAAGG 
TTGCAGCACTTCTTTTCTCTGGGGCCTTGACAGCTTTTTTAACAGGAATAACTGAGCCTTTAGAATTTTTATTTAT 
TTTCACAGCGCCTTTGCTTTATTTTGTTCATGCCGCTTATTCGGGGTTTGCATTGTTGCTTGCTAATTTTTTTAAT 
GTTACGATTGGCAATAGCTTTTCTACTGGATTTTTGGATTTTTTTATGTTTGGGATACTTCAAGGAAATTCTAAGA 
CAAATTGGATTAGTGTATTACCTTTGGGGGCAATGTTTTTTGCTCTTTATTATTTTACTTTTAGTTGGCTTTATAG 
ATACTTTGATTTTCAGATATTTGTTACAGACGATCCATTTTTTGAAGGCCAAGAAGGAAAGCTAGAGAGTCTCGGA 
ATTGCGCATCTTTTAATTCAAGGTCTTGGTGGATTTGATAATATTACAAAGCTTGATGTTTGTTCTACAAGATTGC 
ATGTAGATGTTGTTAATACTGAGCTTGTTGATAATAATTTGCTTAAAGAGGCTGGAGTTCTTAAAATAGGGCTTGT 
TAATGGCAAGGTTCAGCTTTTTTATGGATCTAATGTTTATTATATTAAAAATGCCATTGATACCTATTCTCCAAAG 
AGTCTTTTTGAAGCTAGTGTTATGGTTGCAGTTGATAATGTAAAAAAAGGTTTTAAAACTTATATTGAAATGAAAG 
AAGACAAAAAACTTGAAAAGCAAGGTAAATCAGGAAAAACCTATAAGCTTAGCGAATTAGAAGAAGATTAG 

t843 .nt 

AGAATGGGGCAGGGAACAGCGGCTTTGGGAGGCCTTATTGGTTATTTAACATTTAATATTACTGAAAATTATTTTA 
TTGAGGCTTTTTCAGGGCTTGTTGAAGCAGAGACAATGTCTTCTGTTGGGCGTATAAATTTTTTTGGTGTTCAAAC 
TTTAAATACGGGAATTGCAGGTTCTTTAGCGGTAGGCCTTTTAGTTGGATATTTGCATAACAAATTTTATAATATG 
AAGCTACCCAAACCTTTTGTGTTTTTTTCAGAGTGCCATTTTGTGCCTATAGTAATAATTTTACCCTTTTGTGTTT 
TTTTGGCTATATTTTTTTGTTTGATTTGGTCAAGTTTTGACGATTTAATTGCATCTTTAGGTTTGTTTGTTTTTAG 
GTTTGAATATTTTGGCAGTTTTCTTTATGGATTTTTAAATAGGCTTTTATTGCCTTTGGGGTTGCATTCTATTTTA 
TCTTTTCCTTTTGAGTTTACTTCTTTGGGAGGAGTGGAGATAGTTAATGGCGATACTGTTAGAGGTCTTAAGAATA 
TATTTTATGCTCAGCTATTAGACCCATCACTTGGTAAATTTTCATCAGGCTTTGCCAAAATTAGCAGTGGATTTTA 
TCTATCTATTATGTTTGGACTGCCCGGAGCAGCATTAGGGGTTTACAAGGGTATTGTTCATGAAGATAAAAATAAG 
GTTGCAGCACTTCTTTTCTCTGGGGCCTTGACAGCTTTTTTAACAGGAATAACTGAGCCTTTAGAATTTTTATTTA 
TTTTCACAGCGCCTTTGCTTTATTTTGTTCATGCCGCTTATTCGGGGTTTGCATTGTTGCTTGCTAATTTTTTTAA 
TGTTACGATTGGCAATAGCTTTTCTACTGGATTTTTGGATTTTTTTATGTTTGGGATACTTCAAGGAAATTCTAAG 
■ACAAATTGGATTAGTGTATTACCTTTGGGGGCAATGTTTTTTGCTCTTTATTATTTTACTTTTAGTTGGCTTTATA 
GATACTTTGATTTTCAGATATTTGTTACAGACGATCCATTTTTTGAAGGCCAAGAAGGAAAGCTAGAGAGTCTCGG 
AATTGCGCATCTTTTAATTCAAGGTCTTGGTGGATTTGATAATATTACAAAGCTTGATGTTTGTTCTACAAGATTG 
CATGTAGATGTTGTTAATACTGAGCTTGTTGATAATAATTTGCTTAAAGAGGCTGGAGTTCTTAAAATAGGGCTTG 
TTAATGGCAAGGTTCAGCTTTTTTATGGATCTAATGTTTATTATATTAAAAATGCCATTGATACCTATTCTCCAAA 
GAGTCTTTTTGAAGCTAGTGTTATGGTTGCAGTTGATAATGTAAAAAAAGGTTTTAAAACTTATATTGAAATGAAA 
GAAGACAAAAAACTTGAAAAGCAAGGTAAATCAGGAAAAACCTATAAGCTTAGCGAATTAGAAGAAGATTAG 



MRFKKIFLIIFIISWLKVYSYNyAIQYKNEGIDKYYFEILNDGFGFSLSDFFDDLRSGSLIFTYVSKYNFIINLEA 
HMLTYRGYKDSPKSLISRTDLIEIGFMYYFPILLLINGKlsIFGEIDLGIGVKNLLFGDWGGHLMQSIIHLILNQHRP 
IPSIKSYDSYNYRGFLSFALJT/SYMNFLNLENYMDLSYFADYFIKNSIGITLK1TENIGFDIKLYSQIQNQISSLKT 
YSKTQEAETGIGINYQFYSK2SrFFITM^LNIKNFSTKENFLSVGGFGIIITPEEYKKISESNNEFNVISN^IFYFGFD 
IMIPLKIRNSLFYKINFJ^IIfflYFSISTNYYTNYNETNSFTNQLSSGIMYEFLPQKTFNPYLISGLFFAYWQNNKDI 
KSISRPIRIKNILQVGIENELGFLFKMLKYPJNTEYIFKIYBiCVNYIPIAYNLDEKKLEKHSINFNYLGIGIVVK 
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t850 . aa 

YSYNYAIQYKNEGIDKYYFEILNDGFGFSLSDFFDDLRSGSLIFTYVSKYNFIINLEAHMLTYRGYKDSPKSLISR 
TDLIEIGFMYYFPILLLINGKNFGEIDLGIGVKNLLFGDWGGHLMQSIIHLILNQHRPIPSIKSYDSYNYRGFLSF 
ALNYSYMNFLNLENYMDLSYFADYFIKNSIGITLKNEKIGFDIKLYSQIQNQIKSLKTYSKTQEAETGIGINYQFY 
SKNFFITNNLNIKOTSTKENFLSVGGFGIO^ 

INHYFSISTNYYTNYNETNSFTNQLSSGIMYEFLPQKTFNPYLISGLFFAYNQNWKDIKSISRPIRIKNILQVGIE 
NELGFLFKMLKYRNTEYIFKIYSKWYIPIAYNLDEKKLEKHSINFNYLGIGIWK 



ATGCGGTTTAAAAAAATATTTTTAATAATATTTATAATTTCTAATTTAAAAGTTTATTCTTATAATTATGCAATCC 
AATATAAAAATGAAGGTATTGACAAATATTATTTTGAAATACTAAATGATGGATTCGGATTTTCATTAAGCGATTT 
TTTTGATGACTTGAGAAGTGGTTCTCTTATTTTTACCTATGTTTCAAAATACAATTTTATAATAAATTTAGAAGCA 
CACATGTTAACCTATAGGGGTTATAAAGACTCTCCGAAATCTTTAATTAGTAGAACAGACTTAATTGAAATAGGCT 
TCATGTACTATTTTCCAATTTTATTGCTAATTAATGGAAAAAATTTTGGAGAAATAGACTTGGGAATTGGAG^TAA 
AAACTTATTATTTGGAGACTGGGGAGGGCATTTAATGCAAAGCATAATTCACCTCATTTTAAATCAACACCGTCCA 
ATTCCAAGTATAAAAAGCTACGAGAGCTACAATTATAGAGGATTTTTAAGCTTTGCTCTAAATTACTCTTACATGA 
ATTTTTTAAATTTAGAAAATTATATGGACTTATCTTATTTTGCAGATTATTTTATTAAAAACAGTATTGGAATTAC 
CTTAAAAAATGAAAATATTGGATTTGATATAAAACTTTATTCCCAAATTCAAAA'TCAAATCAAAAGCCTCAAAACA 
TATTCAAAAACACAAGAAGCAGAAACAGGAATTGGAATAAATTATCAATTTTACTCTAAAAATTTTTTCATAACCA 
ATAATTTAAACATTAAAAATTTTTCAACCAAAGAAAATTTCTTAAGCGTTGGGGGATTTGGAATAATCATTACACC 
TGAAGAATACAAAAAAATATCAGAATCAAATAATGAATTTAATGTTATAAGTAATAATTTTTACTTTGGATTTGAT 
ATTATGATCCCATTAAAAATAAGAAATTCATTATTTTATAAAATAAATGAAAACATCAACCATTACTTTTCAATAT 
CAAGAAATTATTACACTAATTATAATGAAACTAATAGCTTTACAAATCAATTATCATCAGGCATCATGTATGAATT 
TTTACCACAAAAAACATTCAATCCTTACCTAATTTCGGGATTATTTTTTGCCTATAATCAAAACAATAAAGATATC 
AAAAGCATCTCAAGACCAATAAGAATAAAAAACATTCTTCAAGTTGGAATTGAAAATGAATTAGGATTTTTGTTCA 
AAATGCTAAAATACCGCAACACTGAGTATATTTTCAAAATATATTCAAAAGTTAACTATATTCCTATAGCTTATAA 
CTTAGATGAAAAAAAATTAGAAAAACATTCTATTAACTTTAATTATTTAGGAATTGGAATAGTCGTTAAATAA 



TATTCTTATAATTATGCAATCCAATATAAAAATGAAGGTATTGACAAATATTATTTTGAAATACTAAATGATGGAT 
TCGGATTTTCATTAAGCGATTTTTTTGATGACTTGAGAAGTGGTTCTCTTATTTTTACCTATGTTTCAAAATACAA 
TTTTATAATAAATTTAGAAGCACACATGTTAACCTATAGGGGTTATAAAGACTCTCCGAAATCTTTAATTAGTAGA 
ACAGACTTAATTGAAATAGGCTTCATGTACTATTTTCCAATTTTATTGCTAATTAATGGAAAAAATTTTGGAGAAA 
TAGACTTGGGAATTGGAGTTAAAAACTTATTATTTGGAGACTGGGGAGGGCATTTAATGCAAAGCATAATTCACCT 
CATTTTAAATCAACACCGTCCAATTCCAAGTATAAAAAGCTACGACAGCTACAATTATAGAGGATTTTTAAGCTTT 
GCTCTAAATTACTCTTACATGAATTTTTTAAATTTAGAAAATTATATGGACTTATCTTATTTTGCAGATTATTTTA 
TTAAAAACAGTATTGGAATTACCTTAAAAAATGAAAATATTGGATTTGATATAAAACTTTATTCCCAAATTCAAAA 
TCAAATCAAAAGCCTCAAAACATATTCAAAAACACAAGAAGCAGAAACAGGAATTGGAATAAATTATCAATTTTAC 
TCTAAAAATTTTTTCATAACCAATAATTTAAACATTAAAAATTTTTCAACCAAAGAAAATTTCTTAAGCGTTGGGG 
GATTTGGAATAATCATTACACCTGAAGAATACAAAAAAATATCAGAATCAAATAATGAATTTAATGTTATAAGTAA 
TAATTTTTACTTTGGATTTGATATTATGATCCCATTAAAAATAAGAAATTCATTATTTTATAAAATAAATGAAAAC 
ATCAACCATTACTTTTCMTATCAACAAATTATTACACTAATTATAATGAAACTAATAGC'TTTACA.A^TCAATTAT 
CATCAGGCATCATGTATGAATTTTTACCACAAAAAACATTCAATCCTTACCTAATTTCGGGATTATTTTTTGCCTA 
TAATCAAAACAATAAAGATATCAAAAGCATCTCAAGACCAATAAGAATAAAAAACATTCTTCAAGTTGGAATTGAA 
AATGAATTAGGATTTTTGTTCAAAATGCTAAAATACCGCAACACTGAGTATATTTTCAAAATATATTCAAAAGTTA 
ACTATATTCCTATAGCTTATAACTTAGATGAAAAAAAATTAGAAAAACATTCTATTAACTTTAATTATTTA.GGAAT 
TGGAATAGTCGTTAAATAA 



f850.nt 



t850 .nt 



f853 .aa 



MKSFLFWILGTVGISSFAQNTPVAIINLYKNEIITKTGFDSKVDIFKKTQGRDLTDAEKKQVLQVLIADVLFSQE 
ASKQGIKISDDEVMQTIRTQFGLVNFTDEQIXQMIEKQGTNWGELLSSMKRSLSSQKLVLKQAQPKFSEIKTPSEK 
EIVEYYEANKTKFWPDISRVSHIFFSTKDKKRSDVLDQAKNILSQIRSKKITFESATOKYSNDESSKAKNGDLGF 
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LSRGDQNAQNLLGADFVKEVFNFNKGDISSPIASKEGFHIVKVTEKYAQRFLGLM)K^/S?TA^LI"<DiI-^^-N 
VQQQQIWQVQQDMYGKLNKSANIQILDSSLK 

t853 .aa 

QWTPVAIINLYKNEIITKTGFDSKVDIFKKTQGRDLTDAEKXQVLQVLIADVLFSOEASXQGIKISDDST/i^QTI-T 
QFGLWFTDEQIKQMIEKQGTIWGELLSSMKRSLSSQKLV^^ 

RVSHIFFSTKDKKRSDVLDQAKNILSQIRSKKITFEEAVRKYSNDESSKAKNGDLGFLSP.GDQNAQNLLGADFVKE 
VFNFNKGDISSPIASKEGFHIVKVTEKYAQRFLGLNDKVSPTADLIWDAIRisnslMIWQCQQIWQVQQDI'r^GKl^I 
KSANIQILDSSLK 

f853 .nt 

ATGAAGAGTTTTTTATTTTGGGTAATATTGGGAACTGTAGGGATTAGCTCTTTTGCTCAAAATACTCCTGTTGCTA 
TTATTAATTTATATAAGAATGAAATTATTACTAAAACTGGTTTTGATTCTAAGGTTGATATATTTAAAA\GACCCA 
AGGTAGAGACTTAACTGATGCTGAGAAAAAGCAAGTTCTGCAAGTTTTAATAGCAGATGTTCTTTTTAGCCAAGAG 
GCTTCAAAGCAAGGAATTAAAATCTCAGATGATGAGGTTATGCAAACAATTAGAACTCAAXTTGGGCTCGTGAA-T 
TTACTGATGAACAAATCAAGCAAATGATAGAAAAACAAGGTACAAATTGGGGCGAGCTTTTGTCTTCAATGAAAiG 
ATCTCTGTCTTCTCAAAAGCTTGTTTTAAAGCAAGCTCAGCCTAAGTTTTCTGAAATTAAAACTCCTAGTGAGAAA 
GAAATTGTTGAGTATTATGAGGCTAATAAAACTAAGTTTGTAAATCCCGATATTTCAAGAGTTAGTCACATCTT-TT 
TTTCTACTAAAGATAAAAAAAGATCAGATGTTTTAGATCAAGCAAAAAATATTTTAAGCCii A A© AGArCAAAAAA 
AATTACTTTTGAAGAAGCTGTAAGAAAATATTCAAATGACGAATCTTCTAAGGCTAAAAATGGTGA-TCCrTGGGTTT 
TTATCAAGAGGTGATCAAAATGCTCAAAATCTTCTTGGAGCCGATTTTGTGAAAGAGGTTTTTAATTTCAATAAGG 
GTGATATATCTTCGCCTATTGCTTCAAAGGAAGGGTTTCATATTGTTAAAGTTACAGAAAAATATGCTCAGAGii^T 
TTTAGGTTTGAATGATAAAGTGTCTCCTACTGCAGATTTGATTGTCAAAGATGCAATAAGAAATAACA'rGA'TTAAT 
GTTCAACAACAGCAAATTGTTGTTCAAGTACAGCAAGATATGTATGGTAAGCTTAACAAGTCTGCAAArATACAAii 
TCTTGGATTCTAGTCTAAAATAA " 

t853 .nt 

CAAAATACTCCTGTTGCTATTATTAATTTATATAAGAATGAAATTATTACTAAAACTGGTTTTGATTCTAAGGTTG 
ATATATTTAAAAAGACCCAAGGTAGAGACTTAACTGATGCTGAGAAAAAGCAAGTTCTGCAAGTTTTAATAGCAGA 
TGTTCTTTTTAGTCAAGAGGCTTCAAAGCAAGGAATTAAAATCTCAGATGATGAGGTTATGCAAACAA'TTAGAACT 
CAATTTGGGCTTGTGAATTTTACTGATGAACAAATCAAGCAAATGATAGAAAAACAAGGTACA\ATTGGGGGGAGC 
TTTTGTCTTCAATGAAAAGATCTCTGTCTTCTCAAAAGCTTGTTTTAAAGCAAGCTCAGCCTAt.GTTTCCTGAA^T 
TAAAACTCCTAGTGAGAAAGAAATTGTTGAGTATTATGAGGCTAATAAAACTAAGTTTGTAAATCCCGPTA.TTTCA 
AGAGTTAGTCATATCTTTTTTTCTACTAAAGATAAAAAAAGATCAGATGTTTTAGATCAAGCAAAAAACATTTTAA 
GCCAAATAAGATCAAAAAAAATTACTTTTGAAGAAGCTGTAAGAAAATATTCAAATGACGAATCTTCTAAGGCTAA 
AAATGGTGATCTTGGGTTTTTATCAAGAGGTGATCAAAATGCTCAAAATCTTCTTGGAGCCC-ATTTTGCGAAAGAG 
GTTTTTAATTTTAATAAGGGTGATATATCTTCGCCTATTGCTTCAAAGGAAGGGTTTCATATTGTTAAAGTTACAG 
AAAAATATGCTCAGAGATTTTTAGGTTTGAATGATAAAGTGTCTCCTACTGCAGATTTGATTGTCAAAC-ATGCAAT 
AAGAAATAACATGATTAATGTTCAACAACAGCAAATTGTTGTTCAAGTACAGCAAGATATGTA'TGGTAAGCTTAAC 
AAGTCTGCAAATATACAAATCTTGGATTCTAGTCTAAAATAA 

f859.aa 

MKLPKLYKLILLFLFTTRLFSVKDEKSDMKLELFSWETKIKKNSKNYDSNSNSKKi:<XESILXRDTOSEKNINSN 

lYIQKSKKINYPNRNLGNNINQKTANDW^ 

I I IKMLKTKGYTL I EYI EDNN 

t859 .aa 

VKDEKSDNKLELFSNVETKIKKNSKNYDSNSNSKKIKKESI^ 
QKTA 



f 859 .nt 
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ATGAAATTACCAAAACTTTACAAATTAATACTACTCTTTCTTTTTACAACAAGATTGTTTTCAGTAAAAGATGAAA 
AATCAGACAATAAATTGGAATTATTTTCAAACGTAGAAACAAAAATCAAAAAAAATTCTAAAAATTACGACTCAAA 
TTCAAACAGCAAAAAGATCAAAAAAGAATCAATTTTAAAAAGAGATACAAACAGCGAAAAAAATATAAATTCCAAT 
ATATACATACAAAAATCAAAAAAAATTAATTACCCCAACAGAAATTTAGGCAATAATATCAATCAAAAAACTGCAA 
ATGATGTAAATTTTACAAAAACTAGTTATGTTAAAGTTTATCCCAACTATAAAGACGATAACTTTCAAGAAATTAA 
AAATGCTAATAAATTTCCAGCTAAAACCGAAAAAACTCACATGCTAATCGGCCCAATATTAAAAGATAATCTAGGA 
ATAATAATTAAAATGCTAAAAACAAAGGGATACACTTTAATAGAATACATAGAGGACAATAATTAA 

t859 .nt 

GTAAAAGATGAAAAATCAGACAATAAATTGGAATTATTTTCAAACGTAGAAACAAAAATCAAAAAAAATTCTAAAA 
ATTACGACTCAAATTCAAACAGCAAAAAGATCAAAAAAGAATCAATTTTAAAAAGAGATACAAACAGCGAAAAAAA 
TATAAATTCCAATATATACATACAAAAATCAAAAAAAATTAATTACCCCAACAGAAATTTAGGCAATAATATCAAT 
CAAAAAACTGCAAATGATGTAAATTTTACAAAAACTAGTTATGTTAAAGTTTATCCCAACTATAAAGACGATAACT 
TTCAAGAAATTAAAAATGCTAATAAATTTCCAGCTAAAACCGAAAAAACTCACATGCTAATCGGCCCAATATTAAA 
AGATAATCTAGGAATAATAATTAAAATGCTAAAAACAAAGGGATACACTTTAATAGAATACATAGAGGACAATAAT 



f 861 .aa 



MKNFKEVIIIFDSGIGGLSYFKYIKSRIGGCQYVYVADNKNFPYGEKSPEYLLEAVLFLIEKLKKIYNIGALVLAC 
NTISVSVYNKLNFVFPWYTLPDVSSVSDLVLKRVLLIATNTTLESKFVKDQVNIHNDLIVKAAGELWFVEYGEN 
YKKYALRCLEALKFEWNTGREIVFLGCTHYLHLKVMIEDFLKIPVYENRELWKNLIRSMNFSEHKGNYVKNDFD 
FVDDEFYLTENKNLTFYQNFCKKYNLRFKGMIV 

t861 .aa 

RIGGCQYVWADNKNFPYGEKSPEYLLEAVLFLIEKLKKIYNIGALVLACNTISVSVYNKLNFVFPWYTLPDVSS 

VSDLVLKRVLLIATNTTLESKFVKDQVNIHNDLIVKAAGELVNFVEYGENYKKYALRCLEALKFEWNTGREIVFL 

GCTHYLHLKVMIEDFLKIPVYENRELWKNLIRSl^FSEHKGNYYKNDFDFVDDEFYLTENKNLTFYONFCKKYNL 
RFKGMIV 

f861.nt 

ATGAAAAATTTCAAAGAAGTAATAATTATTTTTGATTCAGGAATAGGAGGGCTTTCTTATTTTAAATATATTAAAA 
GTAGAATAGGGGGATGCCAATATGTTTATGTTGCCGATAATAAAAATTTCCCTTATGGAGAAAAAAGTCCTGAATA 
TCTTCTAGAAGCAGTTTTGTTTTTGATTGAGAAGCTTAAAAAAATCTATAATATTGGTGCATTAGTTTTGGCTTGT 
AATACAATTTCTGTTAGTGTATACAATAAATTAAATTTTGTTTTTCCAGTAGTCTATACTTTGCCAGATGTAAGTT 
CAGTTTCAGATCTTGTTTTAAAAAGAGTTCTTTTGATTGCAACAAATACTACTCTTGAAAGCAAATTTGTTAAGGA 
TCAAGTAAATATACATAATGATTTGATTGTAAAAGCTGCTGGAGAGCTTGTTAATTTTGTTGAATATGGAGAGAAT 
TACAAAAAATATGCTCTTAGATGTTTAGAAGCTTTAAAATTTGAAGTTGTAAATACTGGTAGAGAAATTGTTTTTC 
TTGGATGCACGCATTATTTGCATCTTAAGGTAATGATAGAAGATTTTTTAAAAATTCCTGTTTATGAGAATCGTGA 
ATTAGTGGTAAAAAATCTTATTAGATCAATGAATTTTTCTGAACACAAAGGTAATTATTATAAGAATGATTTTGAT 
TTTGTAGATGATGAGTTTTATTTGACCGAAAATAAAAATTTGACTTTTTATCAAAATTTTTGCAAAAAATATAATC 
TTCGCTTTAAGGGAATGATAGTTTGA 

t861.nt 



AGAATAGGGGGATGCCAATATGTTTATGTTGCCGATAATAAAAATTTCCCTTATGGAGAAAAAAGTCCTGAATATC 
TTCTAGAAGCAGTTTTGTTTTTGATTGAGAAGCTTAAAAAAATCTATAATATTGGTGCATTAG^TTTGGCTTGTAA 
TACAATTTCTGTTAGTGTATACAATAAATTAAATTTTGTTTTTCCAGTAGTCTATACTTTGCCAG^TGTAAGTTCA 
GTTTCAGATCTTGTTTTAAAAAGAGTTCTTTTGATTGCAACAAATACTACTCTTGAAAGCAAATTTGTTAAGGAT^ 
AAGTAAATATACATAATGATTTGATTGTAAAAGCTGCTGGAGAGCTTGTTAATTTTGTTGAATATGGAGAGAATTA 
CAAAAAATATGCTCTTAGATGTTTAGAAGCTTTAAAATTTGAAGTTGTAAATACTGGTAGAGAAATTGTTTTTCTT 
GGATGCACGCATTATTTGCATCTTAAGGTAATGATAGAAGATTTTTTAAAAATTCC r f , GTTTATGAGAATCG rn GAA r T 1 
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TAGTGGTAAAAAATCTTATTAGATCAATGAATTTTTCTGAACACAAAGGTAATTATTATAAGAATGATTTTGATTT 
TGTAGATGATGAGTTTTATTTGACCGAAAATAAAAATTTGACTTTTTATCAAAATTTTTGCAAAAAATATAATCTT 
CGCTTTAAGGGAATGATAGTTTGA 

£363 .aa 

MIRLKVLILCLFGIFVLNGFADTNFEFNFGGGVAFPVSPFSSFYNEALEINAKLKQNLPSDLSPIEKEEIVQNFSD 
LANIAKAGIRYGTYAQFGAKFDDFVSIGFELLFNINLLKAIKRSDGTANENFSFIMAITPRFYTKLDFFVLALAFF 
TGPKINIATSSADSVLAELGTMGWDIGARLSFSFLILEGYYVWNIKNPKFSDFKFGIGFEFGIV 

t363 .aa 

DTNFEFNFGGGVAFPVSPFSSFYNEALEINAKLKQNLPSDLSPIEKEEIVQNFSDLANIAKAGIRYGTYAQFGAKF 
DDFVSIGFELLFNINLLKAIKRSDGTAMENFSFIMAITPRFYTKLDFFVLALAFFTGPKINIATSSADSVLAELGT 
MGWDIGARLSFSFLILEGYYVWNIKNPKFSDFKFGIGFEFGIV 

f363 .nt 

ATGATTAGGCTTAAAGTTTTAATTTTGTGTTTATTTGGGATTTTTGTGTTAAATGGTTTTGCAGATACTAATTTTG 

AATTCAATTTTGGTGGTGGGGTTGCTTTTCCTGTTAGTCCCTTTTCAAGCTTTTACAATGAGGCTTTAGAGATTAA 

TGCAAAGCTTAAGCAAAATTTGCCTTCAGATTTATCCCCAATAGAAAAAGAAGAGATAGTCCAAAATTTTTCCGAT 

TTAGCCAATATTGCTAAAGCTGGAATAAGATATGGAACTTACGCTCAATTTGGCGCTAAATTTGATGATTTTGTTT 

CTATTGGATTTGAGCTTTTGTTTAACATTAATCTTCTTAAAGCAATAAAGCGTTCGGATGGAACTGCAAATGAAAA 

TTTCTCGTTTATTATGGCAATAACACCAAGATTTTATACAAAATTAGATTTTTTTGTTTTAGCTTTAGCGTTTTTC 

ACAGGTCCTAAGATCAATATAGCGACTTCTTCTGCGGATTCTGTTTTAGCAGAACTGGGAACAATGGGCTGGGATA 

TTGGTGCTAGACTTTCATTTTCTTTTTTAATTCTTGAAGGGTACTATGTTTGGAATATTAAAAACCCTAAATTTTC 

TGATTTCAAGTTTGGAATAGGTTTTGAATTTG 

GAATTGTGTAG 

t363.nt 

GATACTAATTTTGAATTCAATTTTGGTGGTGGGGTTGCTTTTCCTGTTAGTCCCTTTTCAAGCTTTTACAATGAGG 
CTTTAGAGATTAATGCAAAGCTTAAGCAAAATTTGCCTTCAGATTTATCCCCAATAGAAAAAGAAGAGATAGTCCA 
AAATTTTTCCGATTTAGCCAATATTGCTAAAGCTGGAATAAGATATGGAACTTACGCTCAATTTGGCGCTAAATTT 
GATGATTTTGTTTCTATTGGATTTGAGCTTTTGTTTAACATTAATCTTCTTAAAGCAATAAAGCGTTCGGATGGAA 
CTGCAAATGAAAATTTCTCGTTTATTATGGCAATAACACCAAGATTTTATACAAAATTAGATTTTTTTGTTTTAGC 
TTTAGCGTTTTTCACAGGTCCTAAGATCAATATAGCGACTTCTTCTGCGGATTCTGTTTTAGCAGAACTGGGAACA 
ATGGGCTGGGATATTGGTGCTAGACTTTCATTTTCTTTTTTAATTCTTGAAGGGTACTATGTTTGGAATATTAAAA 
ACCCTAAATTTTCTGATTTCAAGTTTGGAATAGGTTTTGAATTTGGAATTGTGTAG 

f368.aa 

MIDLTQEKQEILIKNKFLAKVFGLMSIGLLISAVFAYATSENQTIKAIIFSNSMSFMAMILIQFGLVYAISGALNK 
ISSNTATALFLLYSALTGVTLSSIFMIYTQGSIVFTFGITAGTFLGMSVYGYTTTTDLTKMGSYLIMGLWGIIIAS 
LVNMFFRSSGLNFLISILGWIFTGLTAYDVQNISKMDKMLQDDTEIKNFjyiAWASLKLYLDFINLFLYLLRFLGQ 
RRND 

t3 68 .aa 

TSENQTIKAIIFSNSMSFKAMILIQFGLVYAISGALNKISSNTATALFLLYSALTGVTLSSIFMIYTQGSIVFTFG 
ITAGTFLGMSVTGYTTTTDLTKMGSYLIMGLWGIIIASLVl^FFRSSGLNFLISILGWIFTGLTAYDVQNISKMD 
KMLQDDTEIKNFJyiAWASLKLYLDFINLFLYLLRFLGQRRND 



f368 .nt 
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ATGATCGATTTAACACAAGAAAAA.CAAGAAATACTAATAAAAAACAAGTTTTTAGCCAAAGTTTTCGGGCTTATGT 
CAATTGGACTTTTAATCTCAGCAGTATTTGCATATGCAACCTCAGAAAATCAAACAATCAAAGCAATAATATTCTC 
AAATTCAATGTCATTTATGGCTATGATACTTATACAATTTGGACTTGTATATGCAATAAGTGGTGCTCTTAATAAA 
ATATCAAGCAATACTGCAACAGCTCTTTTCTTGCTCTACTCAGCACTAACAGGAGTAACATTATCTTCTATATTTA 
TGATTTACACACAAGGATCAATAGTATTCACATTCGGAATTACTGCTGGAACATTTCTTGGAATGTCTGTTTATGG 
ATACACTACAACAACAGATCTAACAAAAATGGGAAGCTATTTAATAATGGGCTTATGGGGAATCATTATTGCATCT 
CTTGTTAATATGTTTTTTAGAAGCTCAGGTCTTAATTTCCTTATATCTATTTTGGGCGTAGTTATATTTACAGGCT 
'TAACAGCTTATGATGTTCAAAATATTTCTAAAATGGACAAAATGCTACAAGACGACACTGAAATAAAAAACAGAAT 
GGCGGTTGTAGCCTCACTTAAACTTTATTTAGATTTTATAAATTTATTCTTATATCTTCTAAGATTTTTGGGCCAA 
AGAAGAAACGATTAA 

t368.nt 

ACCTCAGAAAATCAAACAATCAAAGCAATAATATTCTCAAATTCAATGTCATTTATGGCTATGATACTTATACAAT 
TTGGACTTGTATATGCAATAAGTGGTGCTCTTAATAAAATATCAAGCAATACTGCAACAGCTCTTTTCTTGCTCTA 
CTCAGCACTAACAGGAGTAACATTATCTTCTATATTTATGATTTACACACAAGGATCAATAGTATTCACATTCGGA 
ATTACTGCTGGAACATTTCTTGGAATGTCTGTTTATGGATACACTACAACAACAGATCTAACAAAAATGGGAAGCT 
ATTTAATAATGGGCTTATGGGGAATCATTATTGCATCTCTTGTTAATATGTTTTTTAGAAGCTCAGGTCTTAATTT 
CCTTATATCTATTTTGGGCGTAGTTATATTTACAGGCTTAACAGCTTATGATGTTCAAAATATTTCTAAAATGGAC 
AAAATGCTACAAGACGACACTGAAATAAAAAACAGAATGGCGGTTGTAGCCTCACTTAAACTTTATTTAGATTTTA 
TAAATTTATTCTTATATCTTCTAAGATTTTTGGGCCAAAGAAGAAACGATTAA 

f371.aa 

MKFFFLLQIALILLSNSSLLFGQSPPKEKEDSLLLYKEGKFKEAILNTLEEIRLNPSMLDARTILIWSLIAIGEYK 
RAEKEAI IGLGIKKHDIRI IQALGEAYFFQKNYDNALKYFQEYI SLDSKGARIIKVYNLIADSFYELKRYNEADFA 
YEHALRFSPNNQNLLIKLARSRINAKNKILAEEALIKILTISPNNLEAKNLLEELKKSNNKP 

t371 .aa 

EDSLLLYKEGKFKEAILNTLEEIRLNPSNLDARTILIWSLIA.IGEYKRAEKEAI IGLGIKKHDIRI IQALGEAYFF 
QKl^DNALKYFQEYISLDSKGARIIKVYNLIADSFYELKRYNEADFAYEHALRFSPIMQNLLIKLARSRINAKKKI 
IrAEEAL IKILTIS PNNLEAKNLLEELKKSNWKP 

f371.nt 

ATGAAATTTTTTTTTCTATTACAAATAGCTTTAATTCTACTATCCAATTCAAGCTTGTTATTTGGACAATCACCGC 
CTAAAGAAAAAGAAGACTCTCTTCTTCTATATAAAGAAGGAAAATTTAAAGAAGCTATTTTAAACACGTTAGAAGA 
AATTCGACTAAATCCTAGTAACTTAGATGCTAGGACAATATTGATATGGAGCTTAATAGCCATAGGAGAATACAAG 
AGAGCTGAAAAAGAGGCGATTATAGGACTTGGCATTAAAAAACATGACATAAGAATTATTCAAGCACTAGGAGAAG 
CTTATTTCTTTCAAAAAAATTATGACAATGCATTAAAATACTTTCAAGAATACATTAGCCTTGATTCTAAAGGAGC 
AAGAATAATAAAAGTTTATAATTTAATTGCAGATTCTTTTTATGAGCTAAAAAGATATAATGAAGCCGATTTTGCA 
TACGAACATGCATTACGTTTTTCTCCTAATAACCAAAATCTATTAATAAAATTAGCAAGATCAAGAATAAATGCAA 
AAAATAAAATATTAGCAGAAGAAGCACTAATTAAAATTCTTACAATCTCTCCTAATAATCTAGAGGCAAAAAATTT 
ACTAGAAGAATTAAAAAAAAGCAACAACAAACCTTGA 

t371.nt 

GAAGACTCTCTTCTTCTATATAAAGAAGGAAAATTTAAAGAAGCTATTTTAAACACGTTAGAAGAAATTCGACTAA 
ATCCTAGTAACTTAGATGCTAGGACAATATTGATATGGAGCTTAATAGCCATAGGAGAATACAAGAGAGCTGAAAA 
AGAGGCGATTATAGGACTTGGCATTAAAAAACATGACATAAGAATTATTCAAGCACTAGGAGAAGCTTATTTCTTT 
CAAAAAAATTATGACAATGCATTAAAATACTTTCAAGAATACATTAGCCTTGATTCTAAAGGAGCAAGAATAATAA 
AAGTTTATAATTTAATTGCAGATTCTTTTTATGAGCTAAAAAGATATAATGAAGCCGATTTTGCATACGAACATGC 
ATTACGTTTTTCTCCTAATAACCAAAATCTATTAATAAAATTAGCAAGATCAAGAATAAATGCAAAAAATAAAATA 
TTAGCAGAAGAAGCACTAATTAAAATTCTTACAATCTCTCCTAATAATCTAGAGGCAAAAAATTTACTAGAAGAAT 
TAAAAAAAAGCAACAACAAACCTTGA 
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f 502 -aa 

MKKANFLSTNFLILLLVCFVlNn/NLFSK^ 

LNESEIEDEAIYLGLTYNVKLNDFFYFKSELARSISILFFKNSNKKYKNTHSTFLSNFNIGVIKNTIYEDILRLKN 
VNTIFLADNSQELVLALKNDKVDYIYGDCKTLHYIANNFLSEDLVIFTGDVFYSIKNRVAISKNAPEIVKNLNLDL 
FSYLMKMPEELVFSFLDSNAKGSFVDVGLYNDYPPLSFINSQGKLSGILVDLWNLLSRQHIFKPIFKGFSKEDIKK 
SLDGKSVGIFGGllISNDSVLElSrWYW 

IVTOfSYGFIENSITTKYLLKLNGYNGRLKSYDSIFNKNRFLVLAIDNRIYKVIKYILNSIFDDISFESLLQIDKNW 
LDKEEINSSRINSYKIMNKVKFNIEEKIWLSKOTNK 

KVHSSLDLKKLIKSGKIDMLNTNATDSNLDNVFNIKLNSRIPLYIFSNKKRVLPSRSLEKFAILDFLYSKNLASNI 
KSKLILVSSFNEALLLLYKGKVDGIISDEYTAAAVFEELNIDDVEKIPTFRDLAFDLSLAIYNQDYILKEIIQKW 
MRSWDSQlynfLNDWKFDIYYKSRSIRFKNFKFLVITFIIFYFTFLGFVIIFMFRLSFEQKRRYSFVMSEKKIAEAA 
NAAKTIFIAWSHDIRTPINGIMAATELLDTTILTDVQKDYVRMINYSSDSLLSLIDDILYLSKIDVNELYVESQE 
IDLESEMEMVLKAFQSQCAKKNIDLFSYSKSIFNl^IKGDIVKIKQVLINLIGNAFKFTDDGVIVLNYEEVCRTRT 
DGNRVLVTVEFKVIDTGKGIEKENFSKIFEIFKQEDDSSSRVHEGAGLGLSISRELIRLMGGLGIAVDSKVGEGTT 
FSFMLPFLLGSELKSKKLSINRFQSVNGDNKVLNVLLSQKSIKIFEHCSILLGCSSlsn/RYVASFEDAYKVFKKYPS 
YNFVYINVMSJDNIQEGIRLAITOIERLNSDVQIIFL^ 

EDLVPXDSALRIKEPIWLIASDNQVNQKVLKDILWIGINENFIDVVDDGVKALKSLKDKKYTISFIDIRMPRYD 
GFSVAKEIRKFEKAKNLKPCVLVAVTAHALQEYKDKCLASGMNDYISKPIHISSIKTILKKYLQFEVDDIGENENL 
NQLVKFPNLDVNRALKELNLSYVSYSELCRGLTOFISINIIDLEKAFDEEDLSLIKDISHSISGALSNMRSELYKD 
FQKIETSKDSISELKKjVTYSFVKDDLFQLISDIKENILFESEIVSENKLYFKNNDQFLNLLNKLLIGIKTRKPREYK 
EILESINKYVLDDNIQVLFSDLRRNLRLYRFAESSKILEEIIEMLNNKRY 

t502 .aa 

CFWWLFSKDIFKFKLVDQFFPFYYKjOTKGEYEGLIFSIL^ 

WKLNDFFYFKSELARSISILFFKNSNKKYKNTHSTFLSNFNIGVIKNTIYEDILRLK]\rVlTTIFLADNSQELVLAL 
KNDKVDYIYGDCKTLHYIAWNFLSEDLVIFTGDVFYSIKNRVAISRNAPEIVKNLNLDLFSYLMKMPEELVFSFLD 
SNAKGSFVDVGLYNDYPPLSFINSQGKLSGILVDLWNLLSRQHIFKPIFKGFSKEDIKKSLDGKSVGIFGGIISND 
SVLEJSTTOYWSKPIYPLNFKFYSKDLSNDAGPINSQFIDFNFm 

LLKLNGYNGRLKSYDSIFNKNRFLVLAIDNRIYKVIKYILNSIFDDISFESLLQIDKNWLDKEEINSSRINSYKZM 
NKVKFNIEEKIWLSKNNKLNLAVK^MYPIDWEANNYKGINQFLLDKIPJ4FSGLRFNIIKVHSSLDLKKLIKSGKI 
DMLNTNATDSNLDNVFNIKLNSRIPLYIFSNKKRVLPSRSLEKFAILDFLYSKNLASNIKSKLILVSSFNEALLLL 
YKGKVDGIISDEYTAAAVFEELNIDDVEKIPTFRDLAFDLSLAIYtJQDYILKEIIQKVVMRSNVDSQMYLNDWXFD 
IYYKSRSIRFKNFKFLVITFIIFYFTFLGFVIIFMFRLSFEQKRRYSFVMNEKKIAEAANAAKTIFIANVSHDIRT 
PINGIMAATELLDTTILTDVQKDYVRMINYSSDSLLSLIDDILYLSKIDVNELYVESQEIDLESEMEMVLKAFQSQ 
CAKKAIIDLFSYSKSIFNNYIKGDIVKIKQVLINLIGNAFKFTDDGVIVLNYEEVCRTRTDGNRVLVTVEFKVIDTG 
KGIEKENFSKIFEIFKQEDDSSSRVHEGAGLGLSISRELIRLMGGLGIAVDSKVGEGTTFSFMLPFLLGSELKSKK 
LSINRFQSWGDNKVLWLLSQKSIKIFEHCSILLGCS 

RLA1SWIERLNSDVQIIFLFYYLDNKALKNLKYGYVKKPLMGLGICSILYKKEFNPEMDFEDLVPIDSALRIKEPIN 
VLIAEDNQVTSTQKVLKDILVVIGINENFIDVVDDGVKALKSLKDKKYTISFIDIRl'IPRYDGF SVAKEIRKFEKAKNL 
KPCVLVAVTAHALQEYKDKCLASGMNDYISKPIHISSIKTILKKYLQFEVDDIGENENLNQLVKFPNLDVNRALKE 
LNLSYVSYSELCRGLVDFISINIIDLEKAFDEEDLSLIKDISHSISGALSNMRSELYKDFQKIETSKDSISELKKM 
YSFVKDDLFQLISDIKENILFESEIVSENKLYFKNNDQFLNLLNKLLIGIKTRKPREYKEILESINKYVLDDNIQV 
LF S D LRRNLRL YRF AE SSKILEEII EMLNNKRY 

f502.nt 

ATGAAAAAAGCAAACTTTTTAAGTACTAATTTTTTAATTTTACTTTTGGTTTGCTTTGTCAACGTCAATTTATTTT 
CTAAGGATATTTTCAAGTTTAAGCTTGTAGATCAATTTTTTCCTTTTTACTACAAGAATAATAAAGGAGAATATGA 
AGGACTTATTTTTTCTATTTTAGATAAATGGGCAAAAGATAATAATGCTGATATTATGGTTGAGCATATTGATAAT 
TTAAATGAAAGTGAAATTGAAGACGAAGCAATATATTTAGGATTAACTTATAATGTAAAATTAAATGATTTTTTTT 
ATTTTAAAAGTGAGCTTGCTAGGAGTATTTCAATTTTATTTTTTAAAAACTCTAATAAAAAATATAAAAATACCCA 
TTCAACATTTTTATCCAATTTTAATATAGGAGTTATTAAAAATACAATATATGAAGATATCTTAAGGTTAAAAAAC 
GTTAACACCATTTTTTTGGCTGATAATTCTCAAGAGTTAGTATTGGCCTTAAAAAACGATAAAGTTGATTATATAT 
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ATGGTGATTGCAAGACTTTACATTATATTGCAAATAACTTTTTAAGTGAAGATCTTGTGATTTTTACCGGGGATGT 
TTTTTATAGTATCAAAAATAGAGTGGCTATTAGTAGAAATGCTCCTGAGATAGTAAAGAATTTGAATTTAGATTTG 
■TTTTCATATTTAATGAAAATGCCTGAGGAACTTGTTTTTTCTTTTTTAGATAGCAATGCTAAGGGAAGTTTTGTTG 
ATGTTGGTTTATATAATGATTATCCTCCTTTAAGTTTTATTAATTCACAGGGAAAATTGTCTGGCATTTTAGTGGA 
TTTGTGGAATCTTCTCTCAAGACAACATATCTTTAAACCTATTTTTAAGGGATTTTCCAAAGAGGATATTAAGAAA 
TCATTAGATGGAAAATCAGTAGGTATTTTTGGAGGAATTATTAGCAATGATAGTGTGTTGGAAAATGTTAATTATG 
TAGTAAGTAAGCCAATATATCCTCTTAATTTTAAATTTTATTCTAAAGACCTAAGCAATGATGCTGGTCCAATAAA 
TTCTCAGTTTATTGATTTTAATTTTAATAATATTCAATTAAATAAGAATAAAGATATTGTTAATAACTTTATAGAT 
ATTGTTAATAATTCATATGGGTTTATAGAAAATTCAATAACAACAAAATATTTGTTAAAATTAAATGGATATAACG 
GTAGATTAAAATCTTACGATTCGATTTTTAATAAAAATAGGTTTTTAGTATTAGCCATTGATAATAGGATTTATAA 
GGTTATTAAATATATTCTCAATTCTATATTTGATGATATTTCATTTGAATCTTTGCTTCAAATAGATAAAAATTGG 
TTGGATAAAGAAGAGATTAATAGTTCTAGAATAAATAGTTATAAAATTATGAATAAGGTTAAATTTAATATAGAAG 
AAAAAATTTGGTTATCAAAAAATAATAAATTAAATCTTGCTGTTAAAAATTGGTATCCAATAGATTATGTTGAGGC 
AAATAATTATAAAGGAATAAATCAATTTTTGCTTGATAAGATTAGAATGTTTTCAGGTTTGAGATTTAACATAATT 
AAAGTACACAGCAGTTTAGATCTTAAAAAATTAATCAAATCTGGAAAAATCGATATGCTAAATACTAATGCAACCG 
ATTCAAATTTAGATAATGTTTTCAACATAAAATTAAATTCTCGAATTCCACTTTATATTTTTTCAAATAAGAAAAG 
GGTGCTTCCATCTAGATCTTTAGAAAAGTTTGCTATACTTGATTTTTTATATAGTAAAAATTTGGCTTCTAATATT 
AAATCAAAGCTTATTCTGGTAAGCAGTTTTAATGAAGCGTTGCTTCTTCTTTATAAGGGAAAGGTAGATGGGATTA 
TTAGCGATGAGTATACAGCTGCTGCTGTTTTTGAGGAATTAAATATTGATGATGTTGAAAAAATTCCTACTTTTAG 
AGATTTGGCTTTTGATTTGAGTCTTGCTATTTATAATCAAGATTATATCTTGAAAGAAATTATTCAAAAAGTTGTT 
ATGCGTTCAAATGTTGACAGTCAGATGTATTTAAATGATTGGAAATTTGATATTTATTATAAATCCAGAAGTATCA 
GGTTTAAAAATTTCAAATTTTTAGTGATAACATTCATTATATTTTATTTTACTTTTTTAGGATTTGTAATTATATT 
TATGTTCAGATTATCATTTGAGCAGAAAAGAAGATATTCTTTTGTGATGAATGAAAAAAAGATTGCGGAAGCCGCT 
AATGCTGCTAAAACCATTTTTATAGCCAATGTCAGTCATGATATTCGTACCCCTATTAACGGAATAATGGCGGCTA 
CTGAGCTTTTGGATACAACTATTCTTACAGATGTTCAAAAAGATTATGTTAGGATGATAAATTATTCATCTGATTC 
TTTGCTTTCTTTAATTGATGATATATTGTATTTGTCTAAAATAGATGTCAATGAATTATATGTTGAGAGTCAAGAG 
ATTGATTTAGAGAGTGAAATGGAAATGGTTTTAAAAGCTTTTCAATCTCAATGTGCAAAGAAAAATATTGATTTAT 
TCTCTTATTCTAAATCTATTTTTAATAATTATATAAAGGGTGATATTGTAAAAATTAAACAAGTTTTAATTAATTT 
AATAGGAAATGCTTTTAAGTTTACAGATGATGGTGTTATTGTTTTAAATTATGAAGAAGTATGTAGAACAAGAACT 
GATGGTAATAGGGTTTTGGTTACAGTTGAATTTAAGGTAATAGATACAGGCAAAGGGATTGAAAAAGAAAATTTTT 
CTAAGATATTTGAAATATTTAAACAAGAGGATGATTCTTCTTCAAGGGTTCATGAAGGTGCAGGATTGGGATTGTC 
AATATCTAGAGAGCTTATAAGACTAATGGGTGGTCTTGGTATTGCTGTTGATAGCAAGGTGGGAGAGGGTACAACT 
TTTTCATTTATGTTGCCCTTTTTATTGGGTAGTGAGCTTAAAAGTAAAAAATTGTCAATCAATAGATTTCAATCAG 
TAAATGGTGACAATAAAGTATTAAATGTGCTTTTAAGTCAAAAATCTATTAAAATTTTTGAGCACTGTTCGATTTT 
ATTGGGATGCTCTTCTAATGTGCGCTATGTAGCGTCTTTTGAGGATGCTTATAAAGTCTTCAAGAAATACCCTTCT 
TATAATTTTGTTTATATAAATGTAAATAACGATAATATTCAAGAGGGTATTCGACTTGCCAATAATATTGAAAGAC 
TAAATTCTGATGTACAAATTATTTTTTTATTTTATTATTTAGATAATAAAGCTCTAAAAAATTTAAAATATGGTTA 
TGTTAAAAAGCCTTTAATGGGGCTTGGTATATGCTCTATTCTTTATAAAAAAGAGTTTAACCCAGAAATGGATTTT 
GAGGATTTGGTTCCAATAGATAGTGCTTTAAGGATAAAAGAGCCCATTAATGTTTTAATAGCTGAAGATAATCAGG 
TAAATCAAAAAGTGTTGAAAGATATTCTTGTTGTTATAGGCATTAATGAAAATTTTATTGATGTTGTAGATGATGG 
AGTAAAGGCTTTAAAATCTTTAAAAGATAAAAAATATACTATCTCTTTTATTGATATACGAATGCCAAGATATGAT 
GGATTTTCGGTGGCTAAGGAAATTAGAAAATTTGAAAAGGCAAAGAAT'TTAAAGCCTTGTGTTTTGGTTGCTGTAA 
CAGCGCATGCTTTGCAAGAGTATAAAGACAAGTGTCTTGCAAGTGGTATGAATGATTATATCTCAAAACCAATACA 
CATAAGTTCAATTAAAACTATATTAAAAAAATACTTACAGTTTGAAGTTGATGATATTGGGGAGAATGAAAATTTG 

aatcaacttgttaagtttcctaattta.gat?gttaatagggctttaaaagaattaaatctttcatatgtatcatatt 

CTGAATTATGTAGAGGGCTTGTTGATTTTATCTCTATTAATATTATTGATTTGGAAAAAGCTTTTGATGAGGAAGA 

tttgtctttaattaaggatatatctcattcaatatctggagctctttctaatatgcgtagcgaattgtataaagat 
tttcaaaaaattgaaacaagtaaagattcaatttctgagttgaaaaaaatgtattcttttgtaaaagatgatttat 
ttcaactaataagcgacataaaggaaaatattttgtttgagtctgagattgttagtgagaacaagctatattttaa 
aaataatgatcaatttttaaaccttctcaacaaacttttaattggtattaagactagaaagccaagagaatacaaa 
•gaaattcttgagagcattaataaatatgttttagacgataatattcaggtattatttagtgatcttcgcagaaatt 
taagattatatagatttgctgagagctctaagattcttgaagagattattgaaatgcttaataataagagatatta 

G ■ 



t502 .nt 
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TGCTTTGTCAACGTCAATTTATTTTCTAAGGATATTTTCAAGTTTAAGCTTGTAGATCAATTTTTTCCTTTTTACT 
ACAAGAATAATAAAGGAGAATATGAAGGACTTATTTTTTCTATTTTAGATAAATGGGCAAAAGATAATAATGCTGA 
TATTATGGTTGAGCATATTGATAATTTAAATGAAAGTGAAATTGAAGACGAAGCAATATATTTAGGATTAACTTAT 
AATGTAAAATTAAATGATTTTTTTTATTTTAAAAGTGAGCTTGCTAGGAGTATTTCAATTTTATTTTTTAAAAACT 
CTAATAAAAAA'TATAAAAATACCCATTCAACATTTTTATCCAATTTTAATATAGGAGTTATTAAAAATACAATATA 
TGAAGATATCTTAAGGTTAAAAAACGTTAACACCATTTTTTTGGCTGATAATTCTCAAGAGTTAGTATTGGCCTTA 
AAAAACGATAAAGTTGATTATATATATGGTGATTGCAAGACTTTACATTATATTGCAAATAACTTTTTAAGTGAAG 
ATCTTGTGATTTTTACCGGGGATGTTTTTTATAGTATCAAAAATAGAGTGGCTATTAGTAGAAATGCTCCTGAGAT 
AGTAAAGAATTTGAATTTAGATTTGTTTTCATATTTAATGAAAATGCCTGAGGAACTTGTTTTTTCTTTTTTAGAT 
AGCAATGCTAAGGGAAGTTTTGTTGATGTTGGTTTATATAATGATTATCCTCCTTTAAGTTTTAT'TAATTCACAGG 
GAAAATTGTCTGGCATTTTAGTGGATTTGTGGAATCTTCTCTCAAGACAACATATCTTTAAACCTATTTTTAAGGG 
ATTTTCCAAAGAGGATATTAAGAAATCATTAGATGGAAAATCAGTAGGTATTTTTGGAGGAATTATTAGCAATGAT 
AGTGTGTTGGAAAATGTTAATTATGTAGTAAGTAAGCCAATATATCCTCTTAATTTTAAATTTTATTCTAAAGACC 
TAAGCAATGATGCTGGTCCAATAAATTCTCAGTTTATTGATTTTAATTTTAATAATATTCAATTAAATAAGAATAA 
AGATATTGTTAATAACTTTATAGATATTGTTAATAATTCATATGGGTTTA.TAGAAAATTCAATAACAACAAAATAT 
TTGTTAAAATTAAATGGATATAACGGTAGATTAAAATCTTACGATTCGATTTTTAATAAAAATAGGTTTTTAGTAT 
TAGCCATTGATAATAGGATTTATAAGGTTATTAAATATATTCTCAATTCTATATTTGATGATATTTCATTTGAATC 
TTTGCTTCAAATAGATAAAAATTGGTTGGATAAAGAAGAGATTAATAGTTCTAGAATAAATAGTTATAAAATTATG 
AATAAGGTTAAATTTAATATAGAAGAAAAAATTTGGTTATCAAAAAATAATAAATTAJ^TCTTGCTGTTAAAAATT 
GGTATCCAATAGATTATGTTGAGGCAAATAATTATAAAGGAATAAATCAATTTTTGCTTGATAAGATTAGAATGTT 
TTCAGGTTTGAGATTTAACATAATTAAAGTACACAGCAGTTTAGATCTTAAAAAATTAATCAAATCTGGAAAAATC 
GATATGCTAAATACTAATGCAACCGATTCAAATTTAGATAATGTTTTCAACATAAAATTAAATTCTCGAATTCCAC 
TTTATATTTTTTCAAATAAGAAAAGGGTGCTTCCATCTAGATCTTTAGAAAAGTTTGCTATACTTGATTTTTTATA 
TAGTAAAAATTTGGCTTCTAATATTAAATCAAAGCTTATTCTGGTAAGCAGTTTTAATGAAGCGTTGCTTCTTCTT 
TATAAGGGAAAGGTAGATGGGATTATTAGCGATGAGTATACAGCTGCTGCTGTTTTTGAGGAATTAAATATTGATG 
ATGTTGAAAAAATTCCTACTTTTAGAGATTTGGCTTTTGATTTGAGTCTTGCTATTTATAATCAAGATTATATCTT 
GAAAGAAATTATTCAAAAAGTTGTTATGCGTTCAAATGTTGACAGTCAGATGTATTTAAATGATTGGAAATTTGAT 
ATTTATTATAAATCCAGAAGTATCAGGTTTAAAAATTTCAAATTTTTAGTGATAACATTCATTATATTTTATTTTA 
CTTTTTTAGGATTTGTAATTATATTTATGTTCAGATTATCATTTGAGCAGAAAAGAAGATATTCTTTTGTGATGAA 
TGAAAAAAAGATTGCGGAAGCCGCTAATGCTGCTAAAACCATTTTTATAGCCAATGTCAGTCATGATATTCGTACC 
CCTATTAACGGAATAATGGCGGCTACTGAGCTTTTGGATACAACTATTCTTACAGATGTTCAAAAAGATTATGTTA 
GGATGATAAATTATTCATCTGATTCTTTGCTTTCTTTAATTGATGATATATTGTATTTGTCTAAAATAGATGTCAA 
TGAATTATATGTTGAGAGTCAAGAGATTGATTTAGAGAGTGAAATGGAAATGGTTTTAAAAGCTTTTCAATCTCAA 
TGTGCAAAGAAAAATATTGATTTATTCTCTTATTCTAAATCTATTTTTAATAATTATATAAAGGGTGATATTGTAA 
AAATTAAACAAGTTTTAATTAATTTAATAGGAAATGCTTTTAAGTTTACAGATGATGGTGTTATTGTTTTAAATTA 
TGAAGAAGTATGTAGAACAAGAACTGATGGT.AATAGGGTTTTGGTTACAGTTGAATTTAAGGTAATAGATACAGGC 
AAAGGGATTGAAAAAGAAAATTTTTCTAAGATATTTGAAATATTTAAACAAGAGGATGATTCTTCTTCAAGGGTTC 
ATGAAGGTGCAGGATTGGGATTGTCAATATCTAGAGAGCTTATAAGACTAATGGGTGGTCTTGGTATTGCTGTTGA 
TAGCAAGGTGGGAGAGGGTACAACTTTTTCATTTATGTTGCCCTTTTTATTGGGTAGTGAGCTTAAAAGTAAAAAA 
TTGTCAATCAATAGATTTCAATCAGTAAATGGTGACAATAAAGTATTAAATGTGCTTTTAAGTCAAAAATCTATTA 
AAATTTTTGAGCACTGTTCGATTTTATTGGGATGCTCTTCTAATGTGCGCTATGTAGCGTCTTTTGAGGATGCTTA 
TAAAGTCTTCAAGAAATACCCTTCTTATAATTTTGTTTATATAAATGTAAATAACGATAATATTCAAGAGGGTATT 
CGACTTGCCAATAATATTGAAAGACTAAATTCTGATGTACAAATTATTTTTTTATTTTATTATTTAGATAATAAAG 
CTCTAAAAAATTTAAAATATGGTTATGTTAAAAAGCCTTTAATGGGGCTTGGTATATGCTCTATTCTTTATAAAAA 
AG AGTTT AAC CC AGAAATGGATTTTGAGGATTTGGTTC C AAT AGAT AGTGC TTTAAGGATAAAAGAGCCC ATTAAT 
GTTTTAATAGCTGAAGATAATCAGGTAAATCAAAAAGTGTTGAAAGATATTCTTGTTGTTATAGGCATTAATGAAA 
ATTTTATTGATGTTGTAGATGATGGAGTAAAGGCTTTAAAATCTTTAAAAGATAAAAAATATACTATCTCTTTTAT 
TGATATACGAATGCCAAGATATGATGGATTTTCGGTGGCTAAGGAAATTAGAAAATTTGAAAAGGCAAAGAATTTA 
AAGCCTTGTGTTTTGGTTGCTGTAACAGCGCATGCTTTGCAAGAGTATAAAGACAAGTGTCTTGCAAGTGGTATGA 
ATGATTATATCTCAAAACCAATACACATAAGTTCAATTAAAACTATATTAAAAAAATACTTACAGTTTGAAGTTGA 
TGATATTGGGGAGAATGAAAATTTGAATCAACTTGTTAAGTTTCCTAATTTAGATGTTAATAGGGCTTTAAAAGAA 
TTAAATCTTTCATATGTATCATATTCTGAATTATGTAGAGGGCTTGTTGATTTTATCTCTATTAATATTATTGATT 
TGGAAAAAGCTTTTGATGAGGAAGATTTGTCTTTAATTAAGGATATATCTCATTCAATATCTGGAGCTCTTTCTAA 
TATGCGTAGCGAJ\TTGTATAAAGATTTTCAAAAAATTGAAACAAGTAAAGATTCAATTTCTGAGTTGAAAAAAATG 
TATTCTTTTGTAAAAGATGATTTATTTCAACTAATAAGCGACATAAAGGAAAATATTTTGTTTGA.GTCTGAGATTG 
TTAGTGAGAACAAGCTATATTTTAAAAATAATGATCAATTTTTAAACCT'TCTCAACAAACTTTTAATTGGTATTAA 
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GACTAGAAAGCCAAGAGAATACAAAGAAATTCTTGAGAGCATTAATAAATATGTTTTAGACGATAATATTCAGGTA 
TTATTTAGTGATCTTCGCAGAAATTTAAGATTATATAGATTTGCTGAGAGCTCTAAGATTCTTGAAGAGATTATTG 
AAATGCTTAATAATAAGAGATATTAG 

f527 .aa 

MNLLVKIAKFILILFLFTSCNQKQSEIQNLTHLLKSSNKNRLDKFLIIDRVVNIYIANKNYEDALEIVNNGIIDDE 
SREYYPLYLYLMGWIYDSMGEDFVAFNIYKRWDNFDDYWENHSMKTRVAKKIVNLNIDSIDKINYYKFILNMGI 
DNLNNEEKGNYFYNLALSLEDVQDYDESYFYYKKFLSIPRAHLKIDSRDYFNWTKINYFNNREFWYRNLGDLIQ 
DVKNFVLSGNTSKLLNIRDKNNFFIQSWDQKGGKSNSINTNSFLTTMIRLGGRRKNGIQFAKHLEADSSDDISYLE 
S RGWDH I HEWYF VF KR I VY P KD P E INNGWT W I GVYLGKK 

t527 .m 

CNQKQSEIQNLTHLLKSSNKNRLDKFLIIDRVVWIYIANKNYEDALSIVTOJGIIDDESREYYPLYLYLMGNIYDSM 
GEDFVAFNIYKRV^/DNFDDYVYENHSMKTRVAKKIWLNIDSIDKINYYKFILNMGIDNLNNEEKGNYFYMLALSL 
EDVQDYDESYFYYKKFLSIPPJVHLKIDSRDYFlSrWTXINYFMSIPEFVVYRlTLGDLIQDVKNFVLSGNTSKLLNIRD 
KNWFFIQSWDQKGGKSNSINTNSFLTTMIRLGGRRKNGIQFAKHLEADSSDDISYIiESRGWDHIHEWYFVFKRIVY 
PKDPEIjNTNGWTWIGVYLGKK 

f527 .nt 

ATGAATCTATTGGTCAAAATTGCTAAATTTATTTTGATTTTGTTTTTATTTACTTCTTGCAACCAAAAGCAAAGCG 
AGATTCAAAATCTTACACATCTTTTAAAATCTTCTAATAAAAATAGATTAGATAAATTTCTTATTATTGATAGAGT 
TGTTAACATATATATTGCAAATAAAAATTATGAAGATGCTTTAGAAATTGTAAATAATGGAATTATTGATGATGAA 
TCTAGAGAATATTATCCTTTGTATCTTTATTTAATGGGCAATATTTATGATTCCATGGGAGAAGATTTTGTAGCTT 
TTAATATTTACAAGCGTGTTGTTGATAATTTTGATGATTATGTTTATGAAAACCATTCAATGAAAACAAGGGTTGC 
TAAAAAGATTGTCAATTTAAATATTGATTCAATCGATAAAATCAATTATTACAAATTTATATTAAATATGGGGATT 
GATAATTTAAATAATGAGGAAAAGGGTAATTATTTTTATAATCTTGCGCTAAGTTTGGAAGATGTTCAAGATTACG 
ATGAATCTTATTTTTATTATAAAAAATTTCTTTCAATTCCAAGGGCACATTTAAAAATAGATTCTAGAGACTATTT 
TAATGTTGTTACAAAAATTAATTACTTTAATAATCCAGAGTTTGTTGTTTATAGAAATTTAGGAGATTTAATCCAG 
GATGTTAAAAATTTTGTTCTTTCTGGTAATACTTCTAAATTGCTTAATATAAGAGATAAGAATAATTTTTTTATTC 
AAAGCTGGGATCAAAAGGGTGGAAAGAGTAATTCCATTAATACTAATAGCTTTTTAACCACTATGATTAGGCTTGG 
GGGGAGAAGAAAAAACGGAATACAATTTGCAAAGCATCTTGAGGCAGATTCTAGTGACGATATATCTTATCTTGAG 
TCAAGGGGCTGGGACCATATTCATGAATGGTATTTTGTTTTTAAAAGAATTGTTTATCCTAAAGATCCAGAAATTA 
ATAATGGCTGGACTTGGATAGGCGTGTATTTAGGTAAAAAATAA 

t527 .nt 

TGCAACCAAAAGCAAAGCGAGATTCAAAATCTTACACATCTTTTAAAATCTTCTAATAAAAATAGATTAGATAAAT 
TTCTTATTATTGATAGAGTTGTTAACATATATATTGCAAATAAAAATTATGAAGATGCTTTAGAAATTGTAAATAA 
TGGAATTATTGATGATGAATCTAGAGAATATTATCCTTTGTATCTTTATTTAATGGGCAATATTTATGATTCCATG 
GGAGAAGATTTTGTAGCTTTTAATATTTACAAGCGTGTTGTTGATAATTTTGATGATTATGTTTATGAAAACCATT 
CAATGAAAACAAGGGTTGCTAAAAAGATTGTCAATTTAAATATTGATTCAATCGATAAAATCAATTATTACAAATT 
TATATTAAATATGGGGATTGATAATTTAAATAATGAGGAAAAGGGTAATTATTTTTATAATCTTGCGCTAAGTTTG 
GAAGATGTTCAAGATTACGATGAATCTTATTTTTATTATAAA.AAATTTCTTTCAATTCCAAGGGCACATTTAAAAA 
TAGATTCTAGAGACTATTTTAATGTTGTTACAAAAATTAATTACTTTAATAATCCAGAGTTTGTTGTTTATAGAAA 
TTTAGGAGATTTAATCCAGGATGTTAAAAATTTTGTTCTTTCTGGTAATACTTCTAAATTGCTTAATATAAGAGAT 
AAGAATAATTTTTTTATTCAAAGCTGGGATCAAAAGGGTGGAAAGAGTAATTCCATTAATACTAATAGCTTTTTAA 
CCACTATGATTAGGCTTGGGGGGAGAAGAAAAAACGGAATACAATTTGCAAAGCATCTTGAGGCAGATTCTAGTGA 
GGATATATCTTATCTTGAGTCAAGGGGCTGGGACCATATTCATGAATGGTATTTTGTTTTTAAAAGAATTGTTTAT? 
CCTAAAGATCCAGAAATTAATAATGGCTGGACTTGGATAGGCGTGTATTTAGGTAAAAAATAA 

f 541 .aa 



MNKILLLILLESIVFLSCSGKGSLGSEIPKVSLIIDGTFDDKSFNESALNGVKKVKEEFKIELVLKESSSNSYLSD 
LEGLKDAGSDL I WL I GYRFS DVAKVAALQNPDMKYAI ID P IYSND P I P ANLVGMTFRAQEGAFLTGYI AAKLSKTG 
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KIGFLGGIEGEIVDAFRYGYEAGAKYANKDIKISTQYIGSFADLEAGRSVATRMYSDEIDIIHHAAGLGGIGAIEV 
AKELGSGHYIIGVDEDQAYLAPD1WITSTTKDVGRALNIFTSNHLKTNTFEGGKLINYGLKEGWGFVRNPKMISF 
EL3KEIDNLSSKIINKEIIVPSNKESYEKFLKEFI 

t541 .aa 

CSGKGSLGSEIPKVSLIIDGTFDDKSFNSSALNGVKKVKEEFKIELVLKESSSNSYLSDLEGLKDAGSDLIWLIGY 

RFSDVAKVAALQNPDMKYAIIDPIYSNDPIPANLVGMTFRAQEGAFLTGYIAAKLSKTGKIGFLGGIEGEIVDAFR 

YGYEAGAKYANKDIKISTQYIGSFADLEAGRSVATRMYSDEIDIIHHAAGLGGIGAIEVAKELGSGHYIIGVDEDQ 

AYLAPDNVITSTTKDVGRALWIFTSNHLKTNTFEGGKLINYGLKEGVVGFVRNPKMISFELEKEIDNLSSKIINKE 

1 1 VP SNKE S YEKFLKE 

FI 

f541.nt 

ATGAATAAAATATTGTTGTTGATTTTGCTTGAGAGTATTGTTTTTTTATCTTGTAGTGGTAAAGGTAGTCTTGGGA 
GCGAAATTCCTAAGGTATCTTTAATAATTGATGGAACTTTTGATGATAAATCTTTTAATGAGAGTGCTTTAAATGG 
CGTAAAAAAAGTTAAAGAAGAATTTAAAATTGAGCTTGTTTTAAAAGAATCCTCATCAAATTCTTATTTATCTGAT 
CTTGAAGGGCTTAAGGATGCGGGCTCAGATTTAATTTGGCTTATTGGGTATAGATTTAGCGATGTGGCCAAGGTTG 
CGGCTCTTCAAAATCCCGATATGAAATATGCAATTATTGATCCTATTTATTCTAACGATCCTATTCCTGCAAATTT 
GGTGGGCATGACCTTTAGAGCTCAAGAGGGTGCATTTTTAACGGGTTATATTGCTGCAAAACTTTCTAAAACAGGT 
AAAATTGGATTTTTAGGGGGAATAGAAGGCGAGATAGTAGATGCTTTTAGGTATGGGTATGAAGCTGGTGCTAAGT 
ATGCTAATAAAGATATAAAGATATCTACTCAGTATATTGGTAGTTTTGCTGACCTTGAAGCTGGTAGAAGCGTTGC 
AACTAGGATGTATTCTGATGAGATAGACATTATTCATCATGCTGCAGGCCTTGGAGGAATTGGGGCTATTGAGGTT 
GCAAAAGAACTTGGTTCTGGGCATTACATTATTGGAGTTGATGAAGATCAAGCATATCTTGCTCCTGACAATGTAA 
TAACATCTACAACTAAAGATGTTGGTAGAGCTTTAAATATTTTTACATCTAACCATTTAAAAACTAATACTTTCGA 
AGGTGGCAAATTAATAAATTATGGCCTTAAAGAAGGAGTTGTGGGGTTTGTAAGAAATCCTAAAATGATTTCCTTT 
GAAC TTGAAAAAGAAATTGAC AATCTTTC TAGCAAAATAATCAACAAAGAAATT ATTGTTC C ATCTAATAAAGAAA 
GTTATGAGAAGTTTCTTAAAGAATTTATTTAA 

t541 .nt 

TGTAGTGGTAAAGGTAGTCTTGGGAGCGAAATTCCTAAGGTATCTTTAATAATTGATGGAACTTTTGATGATAAAT 

cttttaatgagagtgctttaaatggcgtaaaaaaagttaaagaagaatttaaaattgagcttgttttaaaagaatc 
ctcatcaaattcttatttatctgatcttgaagggcttaaggatgcgggctcagatttaatttggcttattgggtat 
agatttagcgatgtggccaaggttgcggctcttcaaaatcccgatatgaaatatgcaattattgatcctatttatt 

CTAACGATCCTATTCCTGCAAATTTGGTGGGCATGACCTTTAGAGCTCAAGAGGGTGCATTTTTAACGGGTTATAT 

tgctgcaaaactttctaaaacaggtaaaattggatttttagggggaatagaaggcgagatagtagatgcttttagg 
tatgggtatgaagctggtgctaagtatgctaataaagatataaagatatctactcagtatattggtagttttgctg 
accttgaagctggtagaagcgttgcaactaggatgtattctgatgagatagacattattcatcatgctgcaggcct 

TGGAGGAATTGGGGCTATTGAGGTTGCAAAAGAACTTGGTTCTGGGCATTACATTATTGGAGTTGATGAAGATCAA 

gcatatcttgctcctgacaatgtaataacatctacaactaaagatgttggtagagctttaaatatttttacatcta 
accatttaaaaactaatactttcgaaggtggcaaattaataaattatggccttaaagaaggagttgtggggtttgt 

AAGAAATCCTAAAATGATTTCCTTTGAACTTGAAAAAGAAATTGACAA'TCTTTCTAGCAAAATAATCAACAAAGAA 

attattgttccatctaataaagaaagttatgagaagtttcttaaagaatttatttaa 

f561.aa 

MYKNGFFKNYLSLFLIFLVIACTSKDSSNEYVEEQEAENSSKPDDSKIDEHTIGHVFHAMGWHSKKDRKSLGKNI 
KVFYFSEEDGHFQTIPSKENAKLIVYFYDNVYAGEAPISISGKEAFIFVGITPDFKKIINSNEHGAKSDLIGTFKD 
LNIKNSKLEITVDENNSDAKTFLESVNYIIDGVEKISPMLTN 



t5 61 .aa 
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CTSKDSSNEYVEEQEAENSSKPDDSKIDEHTIGHVFHAMGVVHSKKDRKSLGKNIKVFYFSEEDGHFQTIPSKENA 
KLIVYFYDNVYAGEAPISISGKEAFIFVGITPDFKKIINSNLHGAKSDLIGTFKDLNIKNSKLEITVDENNSDAKT 
FLESVNYIIDGVEKISPMLTN 

f561 .nt 

ATGTATAAAAATGGTTTTTTTAAAAACTATTTGTCATTGTTTTTAATTTTTTTAGTAATTGCTTGTACTTCAAAAG 
ATAGCTCAAATGAATATGTTGAGGAGCAAGAAGCGGAGAACTCTTCTAAGCCTGATGATTCTAAAATAGATGAACA 
TACTATTGGGCACGTTTTTCACGCTATGGGAGTAGTTCATTCAAAAAAGGATCGAAAAAGTTTGGGGAAAAATATA. 
AAGGTTTTTTATTTTTCTGAAGAAGATGGACATTTTCAAACAATACCCTCAAAAGAGAATGCAAAGTTAATAGTTT 
ATTTTTATGACAATGTTTATGCAGGAGAGGCTCCAATTAGTATCTCTGGAAAAGAAGCCTTTATTTTTGTTGC-GAT 
TACCCCTGACTTTAAAAAGATTATAAATAGCAATTTACATGGCGCTAAAAGTGATCTTATTGGTACTTTTAAAGAT 
CTTAATATTAAAAATTCAAAATTGGAAATTACAGTTGATGAGAATAATTCAGATGCCAAGACCTTCCTTGAAT^TG 
TTAATTACATTATCGACGGCGTTGAAAAAATTTCACCTATGTTAACGAATTAA 

t561 .nt 



TGTAC TTC AAAAGAT AGC TC AAATGAATATGTTGAGGAGCAAGAAGC GGAGAACTCTTCT AAGC CTGATGATTCTA 
AAATAGATGAACATACTATTGGGCACGTTTTTCACGCTATGGGAGTAGTTCATTCAAAAAAGGATCGAAAAAGTTT 
GGGGAAAAATATAAAGGTTTTTTATTTTTCTGAAGAAGATGGACATTTTCAAACAATACCCTCAAAAGAGAATGCA 
AAGTTAATAGTTTATTTTTATGACAATGTTTATGCAGGAGAGGCTCCAATTAGTATCTCTGGAAAAGAAGCCTTTA 
TTTTTGTTGGGATTACCCCTGACTTTAAAAAGATTATAAATAGCAATTTACATGGCGCTAAAAGTGATCTTATTGG 
TACTTTTAAAGATCTTAATATTAAAAATTCAAAATTGGAAATTACAGTTGATGAGAATAATTCAGATGCCAAGACC 
TTCCTTGAATCTGTTAATTACATTATCGACGGCGTTGAAAAAATTTCACCTATGTTAACGAATTAA 

f 604 .aa 



MSFNKTKKIGKKIKIVTLLMLAVSLIACNNNSEKEKLAFKVYIGGAPSSLDPHLVDETIGARILEQIFSGLLTLNT 
KTGKLKPGLAKNWEASKDKKTYQFYLRDNLFWSDGVEITAEGIRKSFLRILNKETGSTNVDMLKSIIKNGQEYFDG 
KVSDSELGIKAIDSKTLEITLTAPKPYFLELLLHYAFMPVPIHVIEKYKGNWTSPENMVTSGPFKLKKRLPNEKII 
FEKNERYYNAKE VELDELVY ITSDNDLTVYNMYKNNE IDAI FNS I PPD I WE I KLQKDYYQHKSNAIYLYS FNTKI 
KPLDDARVREALTLAIDRETLTYKVLNDGTVPTREITPDLKNYNYGKKLALFDPEKSKKLLADAGYPNGKGFPMLT 
LKYNTNETHKKIAAFIQNQWKKILNINLMLTNENWPVLTNSRNTGNFEIIRVGRIGEYLDPHTYFTIFTRENSQLA 
SYGYSWLEFDKLIRESDLEKDPIKRKQLLRKAESIIIEKDFPAAPIYIYSGHYLFRNDKWTGWNPNVSEVYYLSEL 
KPIKNAKHN 

t604 .aa 



CNNNSEKEKLAFKVYIGGAPSSLDPHLVDETIGARILEQIFSGLLTLNTKTGKLKPGLAKNWEASKDKKTYQ^YLR 
DNLFWSDGVEITAEGIRKSFLRILNKETGSTN^/DMLKSIIKNGQEYFDGKVSDSELGIXAIDSKTLEITLTAPKPY 
FLELLLHYAFMPVPIHVIEKYKGNVJTSPENMVTSGPFKLKKRLPNEKIIFSKNERYYNAKEVELDELV"fITSDNDL 
TVYNMYKNNEIDAIFNSIPPDIVNEIKLQKDYYQHKSNAIYLYSFNTKIKPLDDARVREALTLAIDRETLTYKVLN 
DGTVPTREITPDLKNYNYGKKLALFDPEKSXKLLADAGYPNGKGFPMLTLKYNTNETHKKIAAFIQNQWKKILNIN 
LMLTNENWPVLTNSRNTGNFEIIRVGRIGEYLDPHTYFTIFTRENSQLASYGYSNLEFDKLIRESDLEKDPIKRKQ 
LLRKAESIIIEKDFPAAPIYIYSGHYLFRNDKWTGWNPNVSEVYYLSELKPIKNAKHN 

f604.nt 

ATGAGCTTTAATAAAACTAAAAAAATCGGTAAAAAAATTAAAATAGTAACACTACTTATGCTTGCTGTGTCTTTAA 
TTGCATGCAATAATAATTCAGAAAAAGAAAAATTAGCATTTAAAGTATACATAGGGGGAGCGCCCTCATCGCTTGA 
CCCTCATTTGGTAGATGAGACAATAGGAGCAAGAATTTTAGAACAAATATTCTCAGGGCTTTTGACATTAAATA.ee 
AAAACAGGAAAGCTAAAGCCCGGACTTGCTAAAAATTGGGAAGCCTCAAAAGATAAAAAAACATATCAATTTTATC 
TAAGGGACAACCTTTTTTGGAGCGATGGAGTTGAAATTACCGCTGAAGGGATAAGAAAATCT^TTTTAAGAA^TTT 
AAATAAAGAAACAGGATCTACAAATGTTGACATGCTCAAATCAATAATAAAAAATGGACAAGAGTATTTTnACGGG 
AAAGTATCCGATTCTGAACTTGGAATCAAGGCAATTGATAC-TAAAACGCTGGAAATAACACTTACGGCCC^iX^GC 
CATATTTTCTTGAACTGCTTCTACATTACGCATTCATGCCAGTACCTATTCATGTGATTGAAAAATATAAGGGAAA 
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TTGGACAAGCCCTGAAAACATGGTTACTAGCGGTCCTTTTAAATTAAAAAAAAGATTACCTAATGAAAAAATTATC 
TTTGAAAAAAACGAACGTTATTATAATGCAAAAGAAGTAGAACTTGATGAGCTTGTCTACATTACGTCTGACAATG 
ATCTTACTGTGTACAATATGTACAAAAACAACGAAATTGATGCTATTTTTAACAGCATCCCGCCGGACATTGTAAA 
TGAAATAAAACTACAAAAAGACTATTACCAACACAAAAGTAATGCAATTTATTTATATTCATTTAATACAAAAATA 
AAACCCCTTGATGATGCTAGAGTTAGAGAAGCTTTAACCTTAGCTATTGACAGAGAAACTT TAACTTACAAAGTGC 
TAAATGATGGCACAGTTCCTACAAGAGAAATAACTCCTGATCTTAAAAATTACAATTACGGTAAAAAATTGGCTTT 
ATTTGATCCTGAAAAATCTAAAAAGCTTTTGGCAGATGCAGGGTATCCTAATGGGAAAGGATTCCCAATGCTAACA 
CTAAAATATAATACAAACGAAACTCATAAAAAAATTGCTGCATTTATTCAAAACCAATGGAAAAAAATTCTAAATA 
TCAATCTTATGCTTACCAACGAAAATTGGCCTGTTCTTACCAACAGCAGAAATACTGGCAATTTTGAAATAATAAG 
AGTTGGACGCATTGGGGAATATTTAGATCCACACACATACTTTACTATATTCACAAGAGAAAATTCACAACTTGCA 
TCATACGGATATTCAAACCTAGAATTTGACAAACTCATCAGAGAATCAGATCTTGAAAAAGATCCTATAAAAAGAA 
AACAATTACTCAGAAAAGCAGAATCAATAATAATTGAAAAAGATTTTCCTGCTGCACCAATATACATATATTCTGG 
GCATTATCTTTTTAGAAACGATAAATGGACTGGATGGAATCCTAATGTATCAGAGGTTTATTATCTTTCTGAATTA 
AAACCAATTAAAAATGCAAAACATAATTAA 



TGCAATAATAATTCAGAAAAAGAAAAATTAGCATTTAAAGTATACATAGC-GGGAGCGCCCTCATCGCTTGACCCTC 
ATTTGGTAGATGAGACAATAGGAGCAAGAATTTTAGAACAAATATTCTCAGGGCTTTTGACATTAAATACCAAAAC 
AGGAAAGCTAAAGCCCGGACTTGCTAAAAATTGGGAAGCCTCAAAAGATAAAAAAACATATCAATTTTATCTAAGG 
GACAACCTTTTTTGGAGCGATGGAGTTGAAATTACCGCTGAAGGGATAAGAAAATCTTTTTTAAGAATTTTAAATA 
AAGAAACAGGATCTACAAATGTTGACATGCTCAAATCAATAATAAAAAATGGACAAGAGTATTTTGACGGGAAAGT 
ATCCGATTCTGAACTTGGAATCAAGGCAATTGATAGTAAAACGCTGGAAATAACACTTACGGCCCCAAAGCCATAT 
TTTCTTGAACTGCTTCTACATTACGCATTCATGCCAGTACCTATTCATGTGATTGAAAAATATAAGGGAAATTGGA 
CAAGCCCTGAAAACATGGTTACTAGCGGTCCTTTTAAATTAAAAAAAAGATTACCTAATGAAAAAATTATCTTTGA 
AAAAAACGAACGTTATTATAATGCAAAAGAAGTAGAACTTGATGAGCTTGTCTACATTACGTCTGACAATGATCTT 
ACTGTGTACAATATGTACAAAAACAACGAAATTGATGCTATTTTTAACAGCATCCCGCCGGAGATTGTAAATGAAA 
TAAAACTACAAAAAGACTATTACCAACACAAAAGTAATGCAATTTATTTATATTCATTTAATACAAAAATAAAACC 
CCTTGATGATGCTAGAGTTAGAGAAGCTTTAACCTTAGCTATTGACAGAGAAACTTTAACTTACAAAGTGCTAAAT 
GATGGCACAGTTCCTACAAGAGAAATAACTCCTGATCTTAAAAATTACAATTACGGTAAAAAATTGGCTTTATTTG 
ATCCTGAAAAATCTAAAAAGCTTTTGGCAGATGCAGGGTATCCTAATGGGAAAGGATTCCCAATGCTAACACTAAA 
ATATAATACAAACGAAACTCATAAAAAAATTGCTGCATTTATTCAAAACCAATGGAAAAAAATTCTAAATATCAAT 
CTTATGCTTACCAACGAAAATTGGCCTGTTCTTACCAACAGCAGAAATACTGGCAATTTTGAAATAATAAGAGTTG 
GACGCATTGGGGAATATTTAGATCCACACACATACTTTACTATATTCACAAGAGAAAATTCACAACTTGCATCATA 
CGGATATTCAAACCTAGAATTTGACAAACTCATCAGAGAATCAGATCTTGAAAAAGATCCTATAAAAAGAAAACAA 
TTACTCAGAAAAGCAGAATCAATAATAATTGAAAAAGATTTTCCTGCTGCACCAATATACATATATTCTGGGCATT 
ATCTTTTTAGAAACGATAAATGGACTGGATGGAATCCTAATGTATCAGAGGTTTATTATCTTTCTGAATTAAAACC 
AATTAAAAATGCAAAACATAATTAA 



MKKVIILIFMLSTSLLYNCKNQDNEKIVSIGGSTTVSPILDEMILRYNKINNNTKVTYDAQGSSVGINGLFNKIYK 
IAISSRDLTKEEIEQGAKETVFAYDALIFITSPEIKITNITHENLAKILNGEXQNWKQVGGPDAKINFINRDSSSG 
SYSSIKDLLLNKIFKTHEEAQFRQDGIWKSNGSVIEKTSLTPHSIGYIGLGYAKNSIEKGLNILSVNSTYPTKET 
INSNKYTIKRNLIIVTNNKYEDKSVTQFIDFMTSSTGQDIVEEQGFLGIKT 



CKWQDNEKIVSIGGSTTVSPILDEMILRYNKINNNTKVTYDAQGSSVGINGLFNKIYKIAISSRDLTKEEIEQGAK 
ETVFAYDALIFITSPEIKITNITEENLAKILNGEIQNWKQVGGPDAKINFINRDSSSGSYSSIKDLLLNKIFKTHE 
FAQFRQDGIWKSNGEVIEKTSLTPHSIGYIGLGYAKNSIEKGLNILSVNSTYPTKETINSNKYTIKRNLIIVTNN 
KYEDKSVTQFIDFMTSSTGQDIVEEQGFLGIKT 



tS04.nt: 



f736.aa 



t736 .aa 



f736.nt 
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ATGAAAAAAGTTATTATCTTAATTTTTATGCTATCAACAAGTTTATTATACAACTGTAAAAATCAAGACAATGAAA 

AAATTGTATCAATTGGAGGATCTACAACTGTAAGCCCAATACTAGACGAAATGATTTTAAGATATAATAAAATAAA 

CAATAATACTAAAGTAACATACGATGCACAAGGAAGTAGTGTTGGCATAAACGGGCTATTTAACAAAATATATAAA 

ATAGCAATATCATCAAGAGATTTAACAAAAGAAGAAATTGAACAAGGC-GCAAAAGAAACTGTATTTGCTTATGATG 

CTTTAATTTTCATTACAAGCCCTGAAATAAAAATTACAAATATTACAGAAGAAAATCTAGCTAAAATACTAA ATGG 

AGAAATTCAAAATTGGAAACAAGTGGGAGGTCCTGATGCTAAAATCAACTTTATCAATCGAGACTCTTCTTCTGGT 

TCTTATTCGTCTATAAAAGACCTACTTCTTAATAAAATATTCAAAACTCACGAAGAAGCTCAATTTAGACAAGACG 

GAATAGTGGTAAAATCTAATGC-AGAGGTAATTGAAAAAACAAGCCTTACTCCCCACTCAATAGGATATATAGGTCT 

TGGATACGCAAAAAATTCAATAGAAAAGGGTTTGAATATTCTTTCTGTTAACAGCACATATCCTACAAAAGAAACA 

ATAAATAGCAATAAATACACCATTAAAAGAAATTTAATAATAGTTACAAATAACAAATACGAGGATAAAAGCGTAA 

CTCAATTTATTGATTTCATGACAAGCTCAACTGGACAAGATATTGTTGAAGAACAAGGCTTTTTAGGGATAAAAAC 
ATAA 

t736 .nt 

TGTAAAAATCAAGACAATGAAAAAATTGTATCAATTGGAGGATCTACAACTGTAAGCCCAATACTAGACGAAATGA 
TTTTAAGATATAATAAAATAAACAATAATACTAAAGTAACATACGATGCACAAGGAAGTAGTGTTGGCATAAACGG 
GCTATTTAACAAAATATATAAAATAGCAATATCATCAAGAGATTTAACAAAAGAAGAAATTGAACAAGGGGCAAAA 
GAAACTGTATTTGCTTATGATGCTTTAATTTTCATTACAAGCCCTGAAATAAAAATTACAAATATTACAGAAGAAA 
ATCTAGCTAAAATACTAAATGGAGAAATTCAAAATTGGAAACAAGTGGGAGGTCCTGATGCTAAAATCAACTTTAT 
CAATCGAGACTCTTCTTCTGGTTCTTATTCGTCTATAAAAGACCTACTTCTTAATAAAATATTCAAAACTCA.CGAA 
GAAGCTCAATTTAGACAAGACGGAATAGTGGTAAAATCTAATGGAGAGGTAATTGAAAAAACAAGCCTTACTCCCC 
ACTCAATAGGATATATAGGTCTTGGATACGCAAAAAATTCAATAGAAAAGGGTTTGAATATTCTTTCTGTTAACAG 
CACATATCCTACAAAAGAAACAATAAATAGCAATAAATACACCATTAAAAGAAATTTAATAATAGTTACAAATAAC 
AAATACGAGGATAAAAGCGTAACTCAATTTATTGATTTCATGACAAGCTCAACTGGACAAGATATTGTTGAAGAAC 
AAGGCTTTTTAGGGATAAAAACATAA 

• f752 .aa 

MNKKLNEVLLKLDQDLIKCVKGSLDLEISGVTYSSKLVLPRFVFFALPGIHFDGHDFIEIAIQKGSNWVCSRDVD ' 

FYSPNVTYIKVDDFNIRKFMSNFSNIFYDEPSKKLKVIGVTGTDGKSSVCYYIYLLFKKKGVKVGFISTVFFDDGS 

GSLIKNPYRQSTPESTEIHSFLSTMVKNEAQYAILESTSHGLDLETARLIDVWYFAWFTNIGHEHLEFHGTIQMY 

LNVKLGLFRSVSDDAGFGVINLDDLYSSDFKNAVKKSFTYSLKSSKADFFVSFIDEKTDSTRFEFYHKGVKYLAN^/ 

SLLGSFNVENVMAALILVSQILNIDIQDIVDKLNCIKSLDGRMDSINLGQNFSVIIDYAHTPGAFSKLFPIFKRFA 

TNRLISVFGSAGERDVEKRFLQGQIADIYSDLIILCDEDPRGENSMCIIKDIAKGIVNKVENKDLFFIADRKQAIE 

KAISLAKAGDLWALGKGHESSIIYKNREVFWNEQEWKNAILSLEKSEKEK 

t752 .aa 

CVKGSLDLEISGVTYSSKLVLPRFVFFALPGIHFDGHDFIEIAIQKGSNVWCSRDVDFYSPNVTYIKVDDFNIRK 
FMSNFSNIFYDEPSKKLKVIGVTGTDGKSSVCYYIYLLFKKKGVKVGFISTVFFDDGSGSLIKNPYRQSTPESTEI 
HSFLSTMVKNEAQYAILESTSHGLDLETARLIDVNYFAWFTNIGHEHLEFHGTIQNYLNVKLGLFRSVSDDAGFG 
VINLDDLYSSDFKNAVKKSFTYSLKSSKADFFVSFIDEKTDSTRFEFYHKGVKYLANVSLLGSFWENVMAALILV 
SQILNIDIQDIVDKLNCIKSLDGRMDSINLGQNFSVIIDYAHTPGAFSKLFPIFKRFATNRLISVFGSAGERDVEX 
. RFLQGQIADIYSDLIILCDEDPRGENSMCIIKDIAKGIVNKVENKDLFFIADRKQAIEKAISLAKAGDLVVALGKG 
HESS I IYKNREVFWNEQEWKNAILSLEKSEKEK 

f7 52.nt 

ATGAATAAAAAACTTAATGAAGTTTTATTAAAGTTAGATCAAGATTTAATAAAATGTGTAAAAGGTTCTCTTGATT 
TAGAAATATCAGGAGTTACTTATAGTTCTAAATTGGTTTTGCCCAGGTTTGTGTTTTTTGCTCTTCCAGGAATTCA 
TTTTGATGGGCATGATTTTATTGAAATTGCAATTCAAAAGGGTAGTAATGTTGTTGTGTGTTC^CGAGATGTGGAT 
TTTTACAGTCCTAATGTTACTTATATTAAGGTAGATGACTTTAACATAAGAAAATTTATGTCTAATTTTTCAAATA 
TTTTTTATGATGAGCCTTCAAAAAAATTAAAAGTTATTGGAGTCACTGGCACTGACGGGAAAAGTTCTG^TTG^TA 
TTATATATATCTTCTTTTTAAAAAAAAGGGTGTTAAAGTAGGTTTTATATCGACAGTATTTTTTGATGATGGGAGT 
GGAAGCTTGATTAAAAATCCTTACAGACAATCAACTCCCGAGTCTACGGAAATACATTCATTTTTAAGCACCATGG 




• 
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TTAAAAATGAAGCTCAATATGCAATTCTTGAATCTACTTCTCATGGGCTTGACCTTGAAACAGCAAGGCTTATTGA 
TGTTAATTATTTTGCAGTTGTTTTTACCAATATTGGACATGAGCATCTTGAATTTCATGGCACAATTCAAAATTAT 
TTGAATGTCAAGCTGGGTCTTTTTCGGTCTGTTAGTGATGATGCTGGTTTTGGGGTTATTAATCTTGATGACCTTT 
ATTCTTCTGATTTTAAGAATGCTGTTAAGAAATCTTTTACTTATAGCTTAAAAAGCAGTAAAGCGGATTTTTTTGT 
TAGTTTTATTGATGAGAAAACCGATTCTACTAGATTTGAATTTTATCACAAGGGGGTTAAATATCTTGCTAATGTT 
AGCCTACTGGGGAGTTTTAATGTTGAGAATGTAATGGCTGCTCTTATTTTAGTTTCTCAAATTTTAAATATCGATA 
TTCAAGATATTGTTGATAAACTTAACTGCATTAAAAGTCTTGATGGGCGTATGGATAGTA.TTAATTTGGGGCAAAA 
TTTTTCTGTAATAATTGATTATGCTCATACTCCTGGTGCTTTTTCCAAGCTTTTTCCTATTTTTAAAAGATTTGCT 
ACCAATAGATTGATTTCTGTTTTTGGCTCTGCAGGAGAAAGAGATGTTGAAAAAAGATTTTTGCAAGGGCAAATCG 
CAGATATTTATTCTGATTTAATAATACTTTGCGATGAAGATCCAAGAGGCGAGAATAGTATGTGTATAATTAAAGA 
CATTGCAAAAGGAATTGTAAATAAAGTTGAAAATAAGGATTTATTTTTTATTGCTGATAGAAAGCAGGCTATTGAA 
AAAGCAATAAGTCTTGCAAAAGCAGGAGATTTGGTTGTTGCTTTGGGCAAAGGTCATGAAAGTTCAATAATTTATA 
AAAATAGAGAAGTTTTTTGGAATGAACAAGAGGTAGTTAAAAATGCTATTTTAAGTTTAGAAAAATCAGAAAAGGA 
GAAGTGA 



TGTGTAAAAGGTTCTCTTGATTTAGAAATATCAGGAGTTACTTATAGTTCTAAATTGGTTTTGCCCAGGTTTGTGT 
TTTTTGCTCTTCCAGGAATTCATTTTGATGGGCATGATTTTATTGAAATTGCAATTCAAAAGGGTAGTAATGTTGT 
TGTGTGTTCACGAGATGTGGATTTTTACAGTCCTAATGTTACTTATATTAAGGTAGATGACTTTAACATAAGAAAA 
TTTATGTCTAATTTTTCAAATATTTTTTATGATGAGCCTTCAAAAAAATTAAAAGTTATTGGAGTCACTGGCACTG 
ACGGGAAAAGTTCTGTTTGTTATTATATATATCTTCTTTTTAAAAAAAAGGGTGTTAAAGTAGGTTTTATATCGAC 
AGTATTTTTTGATGATGGGAGTGGAAGCTTGATTAAAAATCCTTACAGACAATCAACTCCCGAGTCTACGGAAATA 
CATTCATTTTTAAGCACCATGGTTAAAAATGAAGCTCAATATGCAATTCTTGAATCTACTTCTCATGGGCTTGACC 
TTGAAACAGCAAGGCTTATTGATGTTAATTATTTTGCAGTTGTTTTTACCAATATTGGACATGAGCATCTTGAATT 
TCATGGCACAATTCAAAATTATTTGAATGTCAAGCTGGGTCTTTTTCGGTCTGTTAGTGATGATGCTGGTTTTGGG 
GTTATTAATCTTGATGACCTTTATTCTTCTGATTTTAAGAATGCTGTTAAGAAATCTTTTACTTATAGCTTAAAAA 
GCAGTAAAGCGGATTTTTTTGTTAGTTTTATTGATGAGAAAACCGATTCTACTAGATTTGAATTTTATCACAAGGG 
GGTTAAATATCTTGCTAATGTTAGCCTACTGGGGAGTTTTAATGTTGAGAATGTAATGGCTGCTCTTATTTTAGTT 
TCTCAAATTTTAAATATCGATATTCAAGATATTGTTGATAAACTTAACTGCATTAAAAGTCTTGATGGGCGTATGG 
ATAGTATTAATTTGGGGCAAAATTTTTCTGTAATAATTGATTATGCTCATACTCCTGGTGCTTTTTCCAAGCTTTT 
TCCTATTTTTAAAAGATTTGCTACCAATAGATTGATTTCTGTTTTTGGCTCTGCAGGAGAAAGAGATGTTGAAAAA 
AGATTTTTGCAAGGGCAAATCGCAGATATTTATTCTGATTTAATAATACTTTGCGATGAAGATCCAAGAGGCGAGA 
ATAGTATGTGTATAATTAAAGACATTGCAAAAGGAATTGTAAATAAAGTTGAAAATAAGGATTTATTTTTTATTGC 
TGATAGAAAGCAGGCTATTGAAAAAGCAATAAGTCTTGCAAAAGCAGGAGATTTGGTTGTTGCTTTGGGCAAAGGT 
CATGAAAGTTCAATAATTTATAAAAATAGAGAAGTTTTTTGGAATGAACAAGAGGTAGTTAAAAATGCTATTTTAA 
GTTTAGAAAAATCAGAAAAGGAGAAGTGA 



MVFHTYKHLELIMLPMLMLSCAFFKKPQSVHQDSNTGKPISDEKLHLISGKISNKKLPIINSNHDVTWIKTKAMTI 
LGEDGKEIPEFKNKFGYSYIISPVKMDGKYSYYASLLILFETTKNGDDEYHIEDVKFVTAGSTLELKNSLLAVSNS 
QEEGYVTAyPFGILMSDEIKNAFKLTYKNGHWNYMLADLTVKNKLTQETKIYKISLNSKLIIEFLKEVLKENSILK 
DIAGDLFEDI 



CAFFKKPQSVHQDSNTGKPISDEKLHLISGKISNKKLPIINSNHDVTWIKTKAMTILGEDGKEIPEFKNKFGYSYI 
ISPVKMDGKYSYYASLLILFETTK^IGDDEYSIEDVKFVTAGSTLELKNSLLAVENSQEEGYVTAYPFGILMSDEIK 
NAFKLTYKNGHWNYMLADLTVKKKLTQETKIYKISLNSKLIIEFLKEVLKENSILKDIAGDLFEDI 



ATGGTATTTAGAACATATAAACATTTGGAACTAATAATGCTGCCCATGTTAATGCTGAGTTGCGCTTTTTTTAAGA 
AACCACAATCTGTACATCAAGACAGCAATACTGGCAAACCAATAAGCGATGAAAAATTACATTTAATATCAGGCAA 
AATTTCAAATAAAAAATTGCCAATCATAAATAGTAATCATGACGTAACTTGGATAAAAACAAAGGCAATGACAATC 



t752 .nt 



f798.aa 



C798 . aa 



f798 .nt 
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TTAGGCGAAGATGGAAAAGAAATACCAGAATTTAAAAACAAATTTGGATATTCTTATATAATATCTCCTGTAAAAA 
TGGATGGAAAATATAGTTATTACGCGTCATTATTAATACTTTTTGAAACAACTAAAAATGGAGATGATGAATATGA 
AATTGAAGATGTTAAATTTGTAACAGCTGGTTCCACCCTAGAACTTAAAAATTCTCTTTTAGCTGTTGAAAATTCA 
CAAGAAGAAGGATATGTTACTGCATACCCATTTGGAATATTGATGAGTGACGAGATTAAAAATGCTTTTAAATTAA 
CATATAAAAATGGTCATTGGAATTATATGCTTGCAGATTTAACTGTCAAAAATAAACTTACTCAAGAAACTAAAAT 
TTATAAAATTTCTCTTAATTCAAAATTAATTATTGAATTTTTAAAAGAAGTGCTAAAAGAAAATTCTATATTAAAA 
GACATAGCTGGAGATTTATTTGAAGATATATAA 

t7 9 8 .nt 

TGCGCTTTTTTTAAGAAACCACAATCTGTACATCAAGACAGCAATACTGGCAAACCAATAAGCGATGAAAAATTAC 
ATTTAATATCAGGCAAAATTTCAAATAAAAAATTGCCAATCATAAATAGTAATCATGACGTAACTTGGATAAAAAC 
AAAGGCAATGACAATCTTAGGCGAAGATGGAAAAGAAATACCAGAATTTAAAAACAAATTTGGATATTCTTATATA 
ATATCTCCTGTAAAAATGGATGGAAAATATAGTTATTACGCGTCATTATTAATACTTTTTGAAACAACTAAAAATG 
GAGATGATGAATATGAAATTGAAGATGTTAAATTTGTAACAGCTGGTTCCACCCTAGAACTTAAAAATTCTCTTTT 
AGCTGTTGAAAATTCACAAGAAGAAGGATATGTTACTGCATACCCATTTGGAATATTGATGAGTGACGAGATTAAA 
AATGCTTTTAAATTAACATATAAAAATGGTCATTGGAATTATATGCTTGCAGATTTAACTGTCAAAAATAAACTTA 
CTCAAGAAACTAAAATOTATAAAATTTCTCTTAATTCAAAATTAATTATTGAATTTTTAAAAGAAGTGCTAAAAGA 
AAATTCTATATTAAAAGACATAGCTGGAGATTTATTTGAAGATATATAA 

f805.aa 

MLRKLKDISKIVLVTDGLTPNCQTCGKLIANGDEVYIAEDGLFHSVKSNTIAGSTLTMIQGLKNLIEFGFSLSDAV 
QASSYNPTRILNIDKKGLICHGYDANLNVLDKDFNLKLTMIESKIIFNNL 

t805 .aa 

CQTCGXLIANGDEVYIAEDGLFHSVKSNTIAGSTLTMIQGLKNLIEFGFSLSDAVQASSYNPTRILNIDKKGLICH 
GYDANLNVLDKDFNLKLTMIESKIIFNNL 

fSOS.nt 

ATGCTTAGAAAGCTTAAAGATATAAGTAAAATAGTCCTTGTAACTGACGGACTTACTCCGAATTGTCAAACTTGTG 
GAAAACTAATTGCAAACGGAGACGAAGTTTATATTGCAGAAGATGGATTATTCCATAGCGTGAAAAGCAACACAAT 
AGCTGGATCAACACTCACAATGATACAAGGTCTTAAAAATTTAATAGAATTTGGTTTCAGCTTAAGCGATGCTGTT 
CAAGCAAGCTCTTACAATCCAACAAGAATTCTCAATATTGATAAAAAGGGCTTAATATGTCATGGATATGATGCAA 
ACCTCAATGTCCTAGATAAAGATTTTAATCTAAAGTTAACAATGATAGAATCTAAAATAATTTTTAACAATCTCTA 
A 

t805.nt 

TGTCAAACTTGTGGAAAACTAATTGCAAACGGAGACGAAGTTTATATTGCAGAAGATGGATTATTCCATAGCGTGA 
AAAGCAACACAATAGCTGGATCAACACTCACAATGATACAAGGTCTTAAAAATTTAATAGAATTTGGTTTCAGCTT 
AAGCGATGCTGTTCAAGCAAGCTCTTACAATCCAACAAGAATTCTCAATATTGATAAAAAGGGCTTAATATGTCAT 
GGATATGATGCAAACCTCAATGTCCTAGATAAAGATTTTAATCTAAAGTTAACAATGATAGAATCTAAAATAATTT 
TTAACAATCTCTAA 

f63 5.aa 

MKILWLIILVNLFLSCGNESKEKSNLGLRLRELEISGGGSESKIEVYKEFIEKEDKNILKIVNSIDKKARFFNLIG 
'LEFFKLGQYGPAIEYFAKNLSINPNNYLSHFYIGVASYNLAKNIjRVKDEVEKYIILAENSFLKSLSIRDDFKDSLF 

■ aisnmy/vydldkqleaknylnklgdmgedyfeflmlrganyyslgdlgnailfydkaskkasteeqkegvsrimsn 

LK 



t635 .aa 
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CGNESKEKSNLGLRLRELEISGGGSESKIEVYKEFIEKEDKNILKIVNSIDKKARFFNLIGLEFFKLGQYGPAIE V 
FAKNLEINPNNYLSHFYIGVASYNLAKNLRVKDEVEKYIILAENSFLKSLSIRDDFKDSLFAISNMYVYDLDKQLE 
AKNYLNKLGDMGEDYFEFLMLRGANYYSLGDLGNAILFYDKASKKASTEEQKEGVSRIMSNLK 

f 635 .nt 



ATGAAAATTTTGTGGTTAATAATTCTTGTTAATTTATTTTTATCTTGTGGCAATGAATCTAAAGAAAAATCAAATC 

TTGGTCTTAGATTAAGAGAATTGGAAATTTCAGGTGGTGGATCTGAATCTAAGATTGAAGTTTATAAAGAATTTAT 

TGAAAAAGAAGATAAGAATATTTTAAAGATAGTTAATTCCATTGATAAGAAAGCCAGATTTTTTAATTTAATTGGT 

CTTGAATTTTTTAAGCTTGGTCAGTACGGACCTGCTATTGAATATTTTGCTAAAAATTTAGAAATCAATCCCAATA 

ATTATTTATCTCATTTTTATATAGGTGTTGCTTCTTATAATTTAGCTAAAAATTTAAGAGTAAAAGATGAAGTTGA 

AAAATACATAATTCTTGCTGAAAATTCTTTTTTAAAATCACTTTCAATTAGAGATGATTTTAAAGATTCTCTTT^T 

GCCATTTCTAATATGTACGTATATGATCTTGATAAACAACTTGAAGCTAAAAATTATTTAAATAAACTTGGTGATA 

TGGGTGAGGACTATTTTGAGTTTTTAATGTTAAGAGGTGCAAATTATTATTCGCTGGGCGATCTTGGTAA^GCTAT 

ATTGTTTTATGATAAAGCTAGTAAAAAGGCTTCAACTGAAGAGCAAAAAGAAGGTGTTTCTAGGATCATGAGTAAT 
TTGAAGTAA 

t635 .nt 



TGTGGCAATGAATCTAAAGAAAAATCAAATCTTGGTCTTAGATTAAGAGAATTGGAAATTTCAGGTGGTGGATCTG 
AATCTAAGATTGAAGTTTATAAAGAATTTATTGAAAAAGAAGATAAGAATATTTTAAAGATAGTTAATTCCATTGA 
TAAGAAAGCCAGATTTTTTAATTTAATTGGTCTTGAATTTTTTAAGCTTGGTCAGTACGGACCTGCTATTGAATAT 
TTTGCTAAAAATTTAGAAATCAATCCCAATAATTATTTATCTCATTTTTATATAGGTGTTGCTTCTTATAATTTAG 
CTAAAAATTTAAGAGTAAAAGATGAAGTTGAAAAATACATAATTCTTGCTGAAAATTCTTTTTTAAAATCACTTTC 
AATTAGAGATGATTTTAAAGATTCTCTTTTTGCCATTTCTAATATGTACGTATATGATCTTGATAAACAACTTGAA- 
GCTAAAAATTATTTAAATAAACTTGGTGATATGGGTGAGGACTATTTTGAGTTTTTAATGTTAAGAGGTGCAAATT 
ATTATTCGCTGGGCGATCTTGGTAATGCTATATTGTTTTATGATAAAGCTAGTAAAAAGGCTTCAACTGAAGAGCA 
AAAAGAAGGTGTTTCTAGGATCATGAGTAATTTGAAGTAA 

f314.aa 

MNWCLIKFFIFLLVFSNSYVAFSKNVNVLIVTAMDSEFDQINKLiMSNKEEIVLKEYGLNKKILKGKLSNRNVMVII 
CGVGKVWAGVWTS Y I LSKYNI SHVINSGVAGGWSAKYKD I KVGD VWS SEVAYHDVDLTKFG YKVGQ LTGGLPQK 
FNANKNLIKNAIEAIKSKVGGSNAYSGLIVSGDQFIDPTYINKIIGNFKDVIAVEMEGAAIGHVSHMFNIPFIVIR 
. SISDIVNKEGNEVEYSKFSKIAAFNSAKWQEILRKLZ 

t314.aa 

KNVNVLIVTAMDSEFDQINKLMSNKEEIVLKEYGLNKKILKGKLS^IMVIICGVGKWAGVWTSYILSKYNISH 
VINSGVAGGWSAKYKDIKA/GDWVSSEVAYHDVDLTKFGYKVGQLTGGLPQKFNANKNLIKWAISAIKSKVGGSN 
AYSGLIVSGDQFIDPTYINKIIGNFKDVIAVSMEGAAIGHVSHMFNIPFIVIRSISDIVNKEGNEVEYSKFSKIAA 
FNSAKWQEILRKLZ 

f.314.nt 

ATGAATAATTGTTTAATAAAGTTTTTTATTTTTTTATTAGTTTTTTCAAACAGTTATGTTGCTTTTTCTAAAAATG 
TCAATGTTTTAATAGTAACTGCTATGGACTCTGAGTTTGATCAGATAAATAAGCTTATGTCTAATAAGGA^GAAAT 
AGTTCTTAAGGAGTATGGTCTTAATAAAAAGATTTTAAAGGGGAAGTTGTCTAATCGCAiiTGTT^GGTTATTATT 
TGTGGGGTTGGTAAGGTTAATGCTGGTGTGTGGACTAGCTACATTTTGTCAAAATACAACATAAGTCATGTCATTA 
ATTCTGGCGTTGCTGGTGGCGTTGTTAGTGCTAAATACAAAGATATTAAAGTGGGAGATGTGGTGGTGTCTTCAGA 
GGTTGCATATCATGATGTTGATTTGACTAAATTTGGATACAAGGTAGGACAGCTTACAGGAGGATTGCCTCAAAAA 
TTTAATGCCAATAAAAATTTAATTAAGAATGCCATAGAGGCCATTAAATCAAAGGTTGGAGGTTCTAATGCATATT 
CAGGATTAATAGTTTCAGGAGATCAGTTTATTGATCCAACTTATATTAACAAAATTATAGGAAACTTTAAAGi^ rp GT 
AATAGCTGTTGAGATGGAAGGTGCAGCAATAGGGCATGTTTCTCATATGTTTAATATACCTTTTATAGTTATTAGG 
TC AATATC TGAC ATTGTAAATAAAGAAGGG AATGAGGTTGAAT AT AGTAAATTTTC TAAAAT AGC TGC TTT C AATT 
CAGCCAAAGTTGTACAAGAAATTTTAAGAAAACTTTAA 
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t314.nt 

AAAAATGTCAATGTTTTAATAGTAACTGCTATGGACTCTGAGTTTGATCAGATAAATAAGCTTATGTCTAATAAGG 
AAGAAATAGTTCTTAAGGAGTATGGTCTTAATAAAAAGATTTTAAAGGGGAAGTTGTCTAATCGCAATGTTATGGT 
TATTATTTGTGGGGTTGGTAAGGTTAATGCTGGTGTGTGGACTAGCTACATTTTGTCAAAATACAACATAAGTCAT 
GTCATTAATTCTGGCGTTGCTGGTGGCGTTGTTAGTGCTAAATACAAAGATATTAAAGTGGGAGATGTGGTGGTGT 
CTTCAGAGGTTGCATATCATGATGTTGATTTGACTAAATTTGGATACAAGGTAGGACAGCTTACAGGAGGATTGCC 
TCAAAAATTTAATGCCAATAAAAATTTAATTAAGAATGCCATAGAGGCCATTAAATCAAAGGTTGGAGGTTCTAAT 
GCATATTCAGGATTAATAGTTTCAGGAGATCAGTTTATTGATCCAACTTATATTAACAAAATTATAGGAAACTTTA 
AAGATGTAATAGCTGTTGAGATGGAAGGTGCAGCAATAGGGCATGTTTCTCATATGTTTAATATACCTTTTATAGT 
TATTAGGTCAATATCTGACATTGTAAATAAAGAAGGGAATGAGGTTGAATATAGTAAATTTTCTAAAATAGCTGCT 
TTCAATTCAGCCAAAGTTGTACAAGAAATTTTAAGAAAACTTTAA 

f 32 . aa 

MNTKTLYLISLILLACNKMSfKIPLIQKLDLPKSSILGFSNKMGIIIKDYAFLSKSTKKNSELDYDYAILLRKDEW 
KIEKTLEKTSRYGIEGNWILWJYKGTKRYIFSKDINIVNNLIIDHSKZ 

t32 -aa 

CNKMMKIPLIQKLDLPKSSILGFSNKMGIIIKDYAFLSKSTKKNSELDYDYAILLRKDEWKIEKTLEKTERYGIE 
GMWILVNYKGTKRYIFSKDINIVNNLIIDHSKZ 

f32 .nt 

ATGAATACAAAAACATTATATTTAATATCCTTAATTCTTTTAGCTTGCAATAAAAATAACAAAATTCCTCTCATTC 
AAAAATTAGATTTGCCCAAAAGCAGCATTCTTGGCTTTAGCAATAAAATGGGCATAATAATAAAAGATTATGCTTT 
TCTTAGTAAAAGCACTAAGAAAAATAGCGAATTGGATTATGATTACGCAATTCTACTCAGAAAAGACGAAGTCGTA 
AAAATTGAAAAAACACTAGAAAAAACAGAGCGCTATGGAATTGAAGGAAATTGGATCCTAGTCAATTACAAGGGAA 
CTAAAAGATACATCTTTAGCAAAGACATCAATATAGTCAACAATTTAATAATTGATCATTCTAAATAG 

t3 2 .nt 

TGCAATAAAAATAACAAAATTCCTCTCATTCAAAAATTAGATTTGCCCAAAAGCAGCATTCTTGGCTTTAGCAATA 
AAATGGGCATAATAATAAAAGATTATGCTTTTCTTAGTAAAAGCACTAAGAAAAATAGCGAATTGGATTATGATTA 
CGCAATTCTACTCAGAAAAGACGAAGTCGTAAAAATTGAAAAAACACTAGAAAAAACAGAGCGCTATGGAATTGAA 
GGAAATTGGATCCTAGTCAATTACAAGGGAACTAAAAGATACATCTTTAGCAAAGACATCAATATAGTCAACAATT 
TAATAATTGATCATTCTAAATAG 

f320 .aa 

MKSIYxALLFLFINLSLLANNISKKDLEVLLKIA 

PEJ^KIPDLVNIDDFEDLKNLGAKTIJCVRKILIEDLIRLIKDAKKFGIEIKIKSAYRTQEYQKFLFDYNVKTYGRK 
VAETQSAIPGHSQHHMGTAIDFINIDDNLLNTKEGKWLYENSLKYGFSVSYPKGYETDTGYKAEPWHYLYIGPKPC 
F I QKKYFNNLQHKLLEFWNQNKTNL INLI EKYANZ 

t320 .aa 

NNISKKDLEVLLKIAQAJyQNTKECKNFI^ 

KNLGAKTIKVRKILISDLIRLIKDAKKFGIEIKIKSAYRTQEYQKFLFDYWKTYGRKVAETQSAIPGHSOHHMGT 
AIDFINIDDNLLNTKEGKWLYENSLKYGFSVSYPKGYETDTGYKAEPWHYLYIGPKPCFIQKKYFNNLQHKLLEFW 
NQNKTNLINLIEKYANZ 



f320 .nt 



TABLE 1. Nucleotide and Amino Acid Sequences 



ATGAAATCAATTTATGCTTTATTATTTCTATTTATTAATTTATCTTTGTTGGCTAACAACATTTCAAAAAAAGAT- 
TAGAAGTACTGCTAAAGATTGCCCAAGCAATGAATAAGGAATGCAAAAATTTTATTGAAAAAAATCCTATTCAGTT 
CTTAAaAGAAATAAaACCCTTAGTAGATGCAGAAAAAAATAACCTCTTAACTCTAATAAATAAAAAAATACCAATT 
CCTGAAAATTATAAAATACCTGATCTGGTAAATATTGATGATTTTGAAGATCTTAAAAATCTTGG^GCAAAGACT\ 

ttaaagtaagaaaaatattaatcgaagatttaattcgactaataaaagatgcaaaaaaatttgggattgaaattaa 

AATCAAATCTGCTTACAGAACGCAAGAATATCAAAAATTTTTATTTGATTACAATGTCAAAACTTATGGCAGAAAA 
GTTGCAGAAACCCAATCAGCAATTCCAGGCCATTCTCAACATCATATGGGAACAGCAATAGATTTTATAAATATAG 
ATGATAATTTACTAAACACAAAAGAAGGAAAATGGCTTTATGAAAACTCTCTAAAATACGGATTTTCCGTTTCATA 

cccaaaaggatatgaaacggacactggatataaagcagagccttggcactacttatacataggacctaagccatgc 

TTTATTCAGAAAAAATATTTTAATAATTTACAACATAAGCTTCTTGAATTTTGGAACCAGAACAAAACAAATCTTA 
TTAACC TAATTGAAAAATATGCAAAC TAA 

t320 .nt 

AACAACATTTCAAAAAAAGATTTAGAAGTACTGCTAAAGATTGCCCAAGCAATGAATAAGGAATGCAAAAATTTTA 
TTGAAAAAAATCCTATTCAGTTCTTAAAAGAAATAAAACCCTTAGTAGATGCAGAAAAAAATAACCTCTTAACTCT 
AATAAATAAAAAAATACCAATTCCTGAAAATTATAAAATACCTGATCTGGTAAATATTGATGATTTTGAAGATCTT 
AAAAATCTTGGAGCAAAGACTATTAAAGTAAGAAAAATATTAATCGAAGATTTAATTCGACTAATAAAAGATGCAA 
AAAAATTTGGGATTGAAATTAAAATCAAATCTGCTTACAGAACGCAAGAATATCAAAAATTTTTATTTGATTACAA 
TGTCAAAACTTATGGCAGAAAAGTTGCAGAAACCCAATCAGCAATTCCAGGCCATTCTCAACATCATATGGGAACA 
GCAATAGATTTTATAAATATAGATGATAATTTACTAAACACAAAAGAAGGAAAATGGCTTTATGAAAACTCTCTAA 
AATACGGATTTTCCGTTTCATACCCAAAAGGATATGAAACGGACACTGGATATAAAGCAGAGCCTTGGCACTACTT 
ATACATAGGACCTAAGCCATGCTTTATTCAGAAAAAATATTTTAATAATTTACAACATAAGCTTCTTGAATTTTGG 
AACCAGAACAAAACAAATCTTATTAACCTAATTGAAAAATATGCAAACTAA 

f342 .aa 



MLYLGDNKAMRTKIIIMTIIILLAPISGFSNSKESARGKFGAGIILPLPIALQINIGNFDLDIGLYSGVNNLFSDW 
KTLFIALDYIFYIYTFPGAANILDFSVGAGGYGTIWFSRFGGSKSGSGPMSIGARLPLALNIAVFRKKFDIFLRIA 
PGLGMNVWSNGVGFRWEVFAGLGLRFWF T Z 

t342 .aa 

LAPISGFSNSKESARGKFGAGIILPLPIALQINIGNFDLDIGLYSGVNNLFSDWKTLFIALDYIFYIYTFPGAANI 
LDF SVGAGGYGT IWFSRFGGSKSGSGPMS IGARLPLALNI AVFRKKFDI FLRIAPGLGMNVWSNGVGFRWEVFAGL 
GLRFWFTZ 

f342 .nt 

ATGCTATACTTAGGAGATAATAAAGCAATGAGAACAAAAATAATTATTATGACAATTATTATTTTATTAGCCCCAA 
TCTCAGGATTTTCTAATTCAAAAGAATCTGCAAGGGGTAAATTTGGAGCAGGAATTATACTTCCATTACCAATTGC 
TCTACAGATTAATATAGGAAACTTTGATCTTGACATTGGTCTTTACAGCGGAGTAAA^AA^TTC-TTTTCAGACTGG 
AAAACATTATTTATAGCATTAGACTATATTTTCTACATATACACATTCCCGGGAGCTGCTAATATTTTGGATTTTT 
CAGTTGGCGCAGGGGGATATGGAACAATATGGTTTTCAAGATTTGGAGGCAGTAAGTCAGGCTCAGGACCAATGAG 
CATTGGAGCAAGATTGCCTTTGGCCTTAAATATTGCAGTATTTAGGAAGAAATTCGACATATTTTTACGAATAGCA 
CCCGGACTTGGAATGAATGTTTGGAGTAATGGCGTTGGATTTAGATGGGAAGTATTCGCAGGATTGGGACTAAGAT 
TCTGGTTTACTTAA 

t342.nt 



TTAGCCCCAATCTCAGGATTTTCTAATTCAAAAGAATCTGCAAGGGGTAAATTTGGAGCAGGAATT^TACT^CCAT 
TACCAATTGCTCTACAGATTAATATAGGAAACTTTGATCTTGACATTGGTCTTTACAGCGGAGTAAATAATTTGTT 
TTCAGACTGGAAAACATTATTTATAGCATTAGACTATATTTTCTACATATACACATTCCCGGGAGCTGCTAATATT 
TTGGATTTTTCAGTTGGCGCAGGGGGATATGGAACAATATGGTTTTCAAGATTTGGAGGCAGTAAGTCAGGC'T'CAG 
GACCAATGAGCATTGGAGCAAGATTGCCTTTGGCCTTAAATATTGCAGTATTTAGGAAGAAATTCGACATATTTTT 
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ACGAATAGCACCCGGACTTGGAATGAATGTTTGGAGTAATGGCGTTGGATTTAGATGGGAAGTATTCGCAGGATTG 
GGACTAAGATTC TGGTTTACTTAA 

f352 .aa 

MNKTKNRSLTYF 1 1 LSCISLFGANNNTISYSS IEI PLEDLSEEFKS SGNKSDQINTSKHLNKNIVSYEDPKKGKDL 
KLPSNIRDKKLPQKRMDENDLKSVISNYENKIKNIEKLLKTKNQKTSENENKKIESIEKKAKKYEILTNKLKNEIV 
EIKKLLNKKIKPKEDENYEKINIENIEEETDDDFEDNYEYNDEIEEQMRTXTLLMKSZ 

t352.aa - , 

CISLFGANNNTISYSSIEIPLEDLSEEFKSSGNKSDQINTSKHLNKNIVSYEDPKKGKDLKLPENIRDKKLPQKRM 

DENDLKSVIEITCENKIKNIEKLLKTKNQKTS^ 

NYEKINIENIEEETDDDFEDNYEYNDEIEEQMRTITLLMKEZ 

f352.nt 

ATGAATAAAACAAAAAATCGAAGCCTTACGTATTTTATAATACTTTCATGTATATCATTATTTGGGGCTAATAATA 
ATACAATAAGCTACTCTAGCATTGAAATTCCTCTAGAAGACTTAAGTGAAGAATTTAAAAGTTCTGGGAATAAAAG 
CGATCAAATAAATACCTCAAAACATTTAAACAAAAACATAGTTTCTTATGAAGACCCAAAAAAGGGTAAAGATCTA 
AAATTGCCAGAAAATATAAGAGACAAAAAACTACCCCAAAAAAGAATGGACGAAAATGATCTAAAATCTGTAATTG 
AAAATTATGAAAATAAAATTAAAAACATAGAAAAGCTTTTAAAAACCAAAAATCAAAAAACATCGGAAAATGAAAA 
TAAAAAAATAGAATCAATCGAAAAAAAAGCAAAAAAATATGAAATTTTAACCAATAAATTAAAAAACGAAATAGTA 
GAAAT AAAAAAGC TC C TT AAC AAAAAAATC AAGC CT AAAGAAGATGAAAATTAC GAAAAAATAAAT ATTGAAAAC A 
TTGAAGAAGAAACTGATGATGATTTTGAAGACAATTATGAATATAATGATGAAATTGAAGAACAAATGAGGACAAT 
TACC CTTCTAATGAAGGAATAA 

t352 .nc 

TGTATATCATTATTTGGGGCTAATAATAATACAATAAGCTACTCTAGCATTGAAATTCCTCTAGAAGACTTAAGTG 
AAGAATTTAAAAGTTCTGGGAATAAAAGCGATCAAATAAATACCTCAAAACATTTAAACAAAAACATAGTTTCTTA 
TGAAGACCCAAAAAAGGGTAAAGATCTAAAATTGCCAGAAAATATAAGAGACAAAAAACTACCCCAAAAAAGAATG 
GACGAAAATGATCTAAAATCTGTAATTGAjyiATTATGAAAATAAAATTAAAAACATAGAAAAGCTTTTAAAAACCA 
AAAATCAAAAAACATCGGAAAATGAAAATAAAAAAATAGAATCAATCGAAAAAAAAGCAAAAAAATATGAAATTTT 
AACCAATAAATTAAAAAACGAAATAGTAGAAATAAAAAAGCTCCTTAACAAAAAAATCAAGCCTAAAGAAGATGAA 
AATTACGAAAAAATAAATATTGAAAACATTGAAGAAGAAACTGATGATGATTTTGAAGACAATTATGAATATAATG 
ATGAAATTGAAGAACAAATGAGGACAATTACCCTTCTAATGAAGGAATAA 

f301.aa 

MQIDGKIYSIISFPVRDSVSTLGVIGILICFDESLDIIENQLYSSLKFGSKKYNFFMLDRNYMPIFSNLNNLQAKS 
FSTAYSENFLSKVIAYAKKDSSSSQYTFNYEPJ3FYSLNFVKTDDFLTQGLILNVWSIPIMFKSNWIFVAFLLLSF 
AIIFYLCNTFVFSLINDFNRIVDYQKSKSDPFSLESPLEVKYSSSIISYISSKLDNLSSKSNESFEKIKFYSEDLN 
EYLEQIETAISNTESIDSSILVYEQLRDTFSRFEKSIVDILKGFESIADPINDHKKYISEISSNFEESVSFFYSID 
^LEIFNKVATINSTDIENIKSKVFDLNIVFENW^ 

GDAGKSFAWAEEIRKLAINSGKYSKTIKDELKTVDSIIAVINSEIDTIYKNFIDIQDNVDNNFSRHEKVDLTLAK 
HFKEIGEFKERYLSHDTKIRDAICTIMYKEIFNNHYFISGKFNNFSQDLKEFKVSKI^NLDAVSSLQEYSSLVKSSKDK 
ILKTKELIQKINDEIKDILFZ 



CFDESLDIIENQLYSSLKFGSKNYNFFMLDRNYMPIFSNLNNLQAKSFSTAYSENFLSKVIAYAKKDSSSSQYTFN 
YERDFYSLNFVKTDDFLTQGLILNWSIPIMFKS3MWVIFVAFLLLSFAIIFYLCNTFVFSLINDFNRIVDYQKSKS 
DPFSLESPLEVKYSSSIISYISSKLDNLSSKSNESFEKIKFYSEDLNEYLEQIETAISNTESIDSSILVYEQLRDT 
FSRFEKSIVDILKGFESIADPINDHNKYISEISSNFEESVSFFYSIDKNLEIFNKVATINSTDISNIKSKVFDLNI 
VFENVNKNFADLLSQTNS LQSVNKLL VS I SAQTNMLAMNAAI EAAKAGDAGKS FAWAEEI RXLAINSGKYSKT IK 
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DELKTVDSIIAVINSEIDTIYKNFIDIQDNVDN^ 

FNNHYFISGKFNNFSQDLKEFKVSKMNLDAVSSLQEYSSLVKSSKDKILKTKELIQKINDEIKDILFZ 
f301 .nt 

ATGCAAATAGATGGGAAAATTTATTCTATAATAAGTTTTCCAGTTAGAGATTCTGTTTCAACATTGGGTGTGATAG 
GGATTTTAATATGCTTTGATGAGTCGTTAGATATTATTGAAAATCAGTTGTATTCTTCTCTTAAATTTGGTAGTAA 
AAATTATAATTTTTTTATGCTTGACAGAAATTACATGCCCATTTTTTCAAACCTTAATAATCTTCAGGCCAAATCT 
TTTTCTACAGCTTATAGTGAGAATTTTTTGAGTAAAGTTATAGCTTATGCTAAAAAAGATTCTTCTAGCTCTCAGT 
ACACTTTTAATTATGAAAGAGATTTTTATTCTTTAAACTTTGTAAAAACCGATGATTTTTTGACTCAGGGGCTTAT 
TTTAAATGTCAATTCCATTCCTATTATGTTTAAATCAAATTGGGTTATATTTGTTGCATTTTTATTATTGTCTTTT 
GCAATTATTTTTTATTTATGCAATACTTTTGTTTTTTCATTAATTAATGATTTTAACAGAATTGTTGACTATCAAA 
AATCAAAAAGCGATCCTTTTAGTCTTGAATCTCCCTTAGAGGTTAAGTATTCTTCATCTATTATTTCTTATATTAG 
TTCAAAGCTAGATAATCTGTCTTCTAAGAGTAATGAATCTTTTGAGAAGATAAAATTTTATTCTGAAGATTTGAAT 
GAATATTTGGAACAAATAGAAACTGCTATATCAAATACTGAGAGTATAGATTCTAGCATTTTAGTTTACGAACAAC 
TAAGAGATACTTTTTCTAGATTTGAAAAATCAATTGTTGATATTTTAAAAGGCTTTGAATCTATTGCTGATCCGAT 
TAATGATCACAATAAATATATATCAGAAATCTCTTCAAATTTTGAAGAGAGTGTTAGTTTTTTCTATAGTATAGAT 
AAAAATTTAGAAATTTTTAATAAGGTTGCTACTATAAATTCTACTGATATTGAAAATATTAAAAGTAAGGTTTTTG 
ATTTAAATATTGTTTTTGAAAATGTGAATAAAAATTTTGCAGATCTTTTGTCTCAAACAAATAGTTTGCAAAGTGT 
AAATAAACTTTTAGTTTCAATTTCAGCTCAGACCAATATGCTTGCTATGAATGCAGCAATTGAAGCAGCAAAAGCA 
GGTGATGCAGGTAAAAGTTTTGCAGTTGTTGCTGAGGAGATTAGAAAGCTTGCTATTAATTCTGGAAAATATTCTA 
AAACCATTAAAGATGAACTTAAAACGGTCGACAGCATTATTGCAGTAATTAATTCAGAGATTGATACAATTTATAA 
AAATTTCATAGACATTCAAGATAATGTGGACAACAATTTTTCAAGACACGAGAAAGTAGATCTTACTCTTGCTAAG 
CATTTTAAAGAAATTGGCGAGTTTAAAGAAAGGTATTTGTCTCACGATACTAAGATCAGAGATGCTAAGAATATGT 
ATAAAGAAATATTTAATAATCATTATTTTATTAGTGGCAAGTTTAACAACTTTAGTCAAGATTTAAAAGAGTTTAA 
AGTTTCTAAGATGAATTTAGATGCGGTAAGTTCTCTTCAAGAATATTCATCTTTAGTAAAGTCTTCTAAGGATAAG 
ATATTAAAGACAAAGGAATTGATTCAAAAGATTAATGATGAGATTAAAGATATTCTTTTTTAG 

t301 .nt 

TGCTTTGATGAGTCGTTAGATATTATTGAAAATCAGTTGTATTCTTCTCTTAAATTTGGTAGTAAAAATTATAATT 
TTTTTATGCTTGACAGAAATTACATGCCCATTTTTTCAAACCTTAATAATCTTCAGGCCAAATCTTTTTCTACAGC 
TTATAGTGAGAATTTTTTGAGTAAAGTTATAGCTTATGCTAAAAAAGATTCTTCTAGCTCTCAGTACACTTTTAAT 
TATGAAAGAGATTTTTATTCTTTAAACTTTGTAAAAACCGATGATTTTTTGACTCAGGGGCTTATTTTAAATGTCA 
ATTCCATTCCTATTATGTTTAAATCAAATTGGGTTATATTTGTTGCATTTTTATTATTGTCTTTTGCAATTATTTT 
TTATTTATGCAATACTTTTGTTTTTTCATTAATTAATGATTTTAACAGAATTGTTGACTATCAAAAATCAAAAAGC 
GATCCTTTTAGTCTTGAATCTCCCTTAGAGGTTAAGTATTCTTCATCTATTATTTCTTATATTAGTTCAAAGCTAG 
ATAATCTGTCTTCTAAGAGTAATGAATCTTTTGAGAAGATAAAATTTTATTCTGAAGATTTGAATGAATATTTGGA 
ACAAATAGAAACTGCTATATCAAATACTGAGAGTATAGATTCTAGCATTTTAGTTTACGAACAACTAAGAGATACT 
TTTTCTAGATTTGAAAAATCAATTGTTGATATTTTAAAAGGCTTTGAATCTATTGCTGATCCGATTAATGATCACA 
ATAAATATATATCAGAAATCTCTTCAAATTTTGAAGAGAGTGTTAGTTTTTTCTATAGTATAGATAAAAATTTAGA 
•AATTTTTAATAAGGTTGCTACTATAAATTCTACTGATATTGAAAATATTAAAAGTAAGGTTTTTGATTTAAATATT 
GTTTTTGAAAATGTGAATAAAAATTTTGCAGATCTTTTGTCTCAAACAAATAGTTTGCAAAGTGTAAATAAACTTT 
TAGTTTCAATTTCAGCTCAGACCAATATGCTTGCTATGAATGCAGCAATTGAAGCAGCAAAAGCAGGTGATGCAGG 
TAAAAGTTTTGCAGTTGTTGCTGAGGAGATTAGAAAGCTTGCTATTAATTCTGGAAAATATTCTAAAACCATTAAA 
.GATGAACTTAAAACGGTCGACAGCATTATTGCAGTAATTAATTCAGAGATTGATACAATTTATAAAAATTTCATAG 
ACATTCAAGATAATGTGGACAACAATTTTTCAAGACACGAGAAAGTAGATCTTACTCTTGCTAAGCATTTTAAAGA 
AATTGGCGAGTTTAAAGAAAGGTATTTGTCTCACGATACTAAGATCAGAGATGCTAAGAATATGTATAAAGAAATA 
TTTAATAATCATTATTTTATTAGTGGCAAGTTTAACAACTTTAGTCAAGATTTAAAAGAGTTTAAAGTTTCTAAGA 
TGAATTTAGATGCGGTAAGTTCTCTTCAAGAATATTCATCTTTAGTAAAGTCTTCTAAGGATAAGATATTAAAGAC 
AAAGGAATTGATTCAAAAGATTAATGATGAGATTAAAGATATTCTTTTTTAG 



f346 .aa 
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MSIDKVPDEAFAEKIVGDGIAILPTSNELLAPCDGKIGKIFKTNHAFSLETKEGVSIFVHFGINTLNLNGKGFTRV 
AEEGINVKQGEVIIRLDLEYLKEHSESVITPWIANSDEVSSIEYSFGRLENDSEYILSSSTVLTEEIRHKISQTK 
PVIAGKDLVLRVKKZ 

t346.aa 

CDGKIGKIFKTNHAFSLETKEGVEIFVHFGINTLNLNGKGFTRVAEEGINVKQGEVIIRLDLEYLKEHSESVITPV 
VIANSDEVSSIEYSFGRLENDSEYILSSSTVLTEEIRHKISQTKPVIAGKDLVLRVKKZ 

f 346 .nt 

ATGTCAATTGATAAGGTTCCCGATGAAGCTTTTGCTGAAAAAATAGTTGGCGATGGAATTGCAATTCTTCCAACAA 
GCAATGAGTTGTTGGCGCCTTGTGATGGGAAAATAGGTAAAATTTTTAAAACCAATCATGCCTTTAGCCTTGAAAC 
TAAAGAGGGCGTTGAAATTTTTGTCCATTTTGGAATTAATACTCTTAATTTAAATGGTAAGGGTTTTACAAGAGTT 
GCTGAAGAGGGCATTAATGTTAAACAAGGTGAAGTTATTATTAGGCTTGATCTTGAATATTTAAAAGAGCATTCAG 
AATCCGTTATTACTCCGGTTGTTATTGCAAATTCTGATGAAGTTTCAAGTATAGAATATTCTTTTGGAAGGCTTGA 
AAATGATTCTGAATATATTTTATCATCTTCAACTGTCTTGACAGAAGAAATTAGGCATAAAATATCTCAAACAAAG 
CCTGTTATAGCGGGCAAAGATTTGGTGTTGCGAGTTAAAAAGTAA 

t346 .nt 

TGTGATGGGAAAATAGGTAAAATTTTTAAAACCAATCATGCCTTTAGCCTTGAAACTAAAGAGGGCGTTGAAATTT 
TTGTCCATTTTGGAATTAATACTCTTAATTTAAATGGTAAGGGTTTTACAAGAGTTGCTGAAGAGGGCATTAATGT 
TAAACAAGGTGAAGTTATTATTAGGCTTGATCTTGAATATTTAAAAGAGCATTCAGAATCCGTTATTACTCCGGTT 
GTTATTGCAAATTCTGATGAAGTTTCAAGTATAGAATATTCTTTTGGAAGGCTTGAAAATGATTCTGAATATATTT 
TATCATCTTCAACTGTCTTGACAGAAGAAATTAGGCATAAAATATCTCAAACAAAGCCTGTTATAGCGGGCAAAGA 
TTTGGTGTTGCGAGTTAAAAAGTAA 

f373 .aa 

MSryQRIKNYCKFTSVFLFFLFSCVSNELKLDQSLVKGKLWGLRYYIYKWQTPKN 

IAHYLEHMAFNGTKDYPGNS IVDVLKKFGMQFGADINAATSFDFTYYRLDLSDGNNKDE IDES INILRNWASQ ISF 
MKEExDLERNIIIEEKKLGETYPGRIYEKMDKFLTSGSLYEFRSPIGLEEQILSFQPEDFKKFYRKWYRPELASVI 
WGDIDPIEIEEKIKKQFVSWKNPTDKXKEVKVSLDVELKDKFLLLEDLEVGEPSLMFFKKEIINFVKTKDDLLNA 
IKKSLLAALFENRFSELKTAGVKQFKNVSNKDFFSFKSDNNTIVAKSISLNFNPDHLNEGIQDFFYELERIRKFGF 
TQGELEKVRSQFYKSLELRKKNINKTNSWAIFQDLIEIAINGSNKFDMNEYCDLSFQYLEKIDLKTIN1SILVGREFD 
VKNCAIFYSYHGRAHPVLTLEDIDNLQKIALKRELKPYENSLIEGKFFKKSLDDKDIIRENEFENEISSFVLENGV 
EVYFKYNDQKKGVIDFSATSWGGLINEDLKLIPVLSFAPGWSGSGYGDYSALQIEKYLSDKAVSLRVGVGAQESY 
ISGSSDKKDLETLFQLIYFTFKEPKIDBVSLQNAINWIKALIKSNENSSDYHFHKAISKFLITONDPRFEDTKDSDL 
QYFTKENI L SF YKKRFT YANNFKF VLLETQ IFRQZ 

t373 .aa 

CVSNELKLDQSLVKGKLVNGLRYYIYKNQTPKNAVlSMGIVFiWGSLNEEDNERGIA 

DVLKKFGMQFGADINAATSFDFTYYRLDLSDGmfKJDEIDESINILRlWASQISFMKEEIDLERNIIIEEKKLGETY 
PGRIYEKMDKFLTSGSLYEFRSPIGLEEQILSFQPEDFKKFYRKWYRPELASVIWGDIDPIEIEEKIKKQFVSWK 
NPTDKIKEVKVSLDVELKDKFLLLEDLEVGEPSLMFFKKEIINFVKTKDDLLNAIKKSLLAALFENRFSELKTAGV 
KQFKNVSNKDFFSFKSDNNTIVAKSISLNFNPDHLMEGIQDFFYELERIRKFGFTQGELEKVRSQFYKSLELRKKN 
INKTNSWAIFQDLIEIAINGSNKFDMNEYCDLSFQYLEKIDLKTINNLVGREFDVKNCAIFYSYHGRAHPVLTLED 
IDNLQKIALKRELKPYENSLISGKFFKKSLDDKDIIRENEFENEISSFVLENGVEVYFKYNDQKKGVIDFSATSWG 
GLINEDLKLIPVLSFAPGWSGSGYGDYSALQIEKYLSDKAVSLRVGVGAQESYISGSSDKKDLETLFQLIYFTFK 
EPKIDDVSLQNAINNIKALIKSNENSSDYHFHKAISKFLNWNDPRFSDTKDSDLQYFTKENILSFYKKRFTYAJSP^F 
KFVLLETQ IFRQZ 



f 3 73 .nt 
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ATGAATTATCAAAGAATTAAGAATTATTGTAAATTTACAAGCGTTTTTCTATTTTTTTTGTTTTCCTGTGTTTCTA 
ATGAGTTAAAGTTAGATCAAAGTTTGGTAAAAGGAAAACTTGTCAATGGGCTAAGGTATTATATTTATAAAAATCA 
, AACCCCAAAGAATGCCGTTAATATGGGAATTGTTTTTAATGTGGGCTCACTTAATGAAGAAGATAATGAGAGGGGA 
ATAGCGCATTATCTTGAACATATGGCTTTTAATGGTACAAAAGATTATCCAGGGAATTCTATAGTTGATGTTCTTA 
AAAAATTTGGAATGCAATTTGGTGCTGACATTAATGCTGCTACTAGTTTTGATTTCACTTATTATAGACTTGATTT 
GTCAGATGGTAATAATAAAGATGAAATTGATGAATCTATAAATATTTTGAGAAACTGGGCTTCTCAAATCAGTTTC 
ATGAAAGAAGAAATAGATCTAGAGCGAAATATTATTATTGAGGAAAAAAAGCTTGGTGAGACTTATCCTGGAAGAA 
TTTATGAGAAAATGGATAAGTTTTTGACAAGCGGAAGTCTTTATGAATTTAGAAGTCCTATTGGACTTGAAGAGCA 
AATTTTATCTTTTCAGCCAGAAGATTTTAAAAAATTTTATAGAAAGTGGTATAGGCCAGAACTTGCAAGTGTTATT 
GTGGTAGGAGATATTGATCCTATAGAAATTGAAGAGAAGATAAAGAAGCAATTTGTTTCTTGGAAAAATCCAACCG 
ATAAAATTAAAGAAGTAAAAGTAAGTTTAGACGTAGAGCTTAAGGATAAATTTTTACTTTTAGAAGATTTGGAAGT 
TGGAGAGCCTAGTTTAATGTTCTTTAAAAAGGAAATTATTAACTTTGTAAAGACCAAAGATGACCTTTTAAATGCT 
ATTAAAAAGTCTTTATTAGCCGCTCTTTTTGAAAATAGATTTTCTGAATTAAAGACTGCTGGGGTAAAGCAATTTA 
AAAATGTTTCAAATAAAGATTTTTTCTCATTTAAATCAGATAACAATACCATTGTTGCAAAATCGATTTCTTTAAA 
CTTTAATCCAGATCATTTGAACGAAGGAATACAAGACTTTTTTTATGAGCTTGAGAGGATAAGAAAATTTGGATTT 
ACCCAAGGTGAGCTTGAAAAAGTTAGATCTCAATTTTACAAATCTTTAGAATTAAGGAAAAAGAATATAAATAAAA 
CAAATTCATGGGCTATTTTTCAGGATTTAATAGAAATTGCTATTAATGGTTCTAATAAATTTGATATGAATGAATA 
TTGCGATCTTTCTTTTCAATATTTGGAAAAGATTGATTTAAAAACAATAAACAATCTTGTAGGAAGAGAGTTTGAT 
GTAAAAAATTGTGCAATTTTTTATTCTTACCATGGAAGAGCACATCCTGTTTTAACTCTTGAAGATATTGACAATC 
TTCAAAAGATAGCTTTAAAAAGAGAGTTAAAGCCTTATGAGAATTCTTTAATTGAAGGTAAATTTTTTAAGAAGTC 
TTTAGATGATAAAGATATTATTAGAGAAAATGAGTTTGAAAATGAAATTTCGTCATTTGTTCTTGAAAATGGGGTT 
GAAGTTTATTTTAAATATAATGATCAAAAAAAAGGTGTAATTGATTTTAGTGCAACTTCTTGGGGAGGTTTAATTA 
ATGAAGATTTAAAACTTATTCCTGTTTTATCTTTTGCTCCCGGAGTAGTATCTGGTTCGGGTTATGGTGATTATTC 
TGCATTACAGATTGAAAAATATTTATCAGATAAAGCTGTTTCTTTAAGAGTTGGGGTTGGAGCTCAAGAATCATAT 
ATTTCTGGAAGTTC AGAT AAAAAAGATC TTGAAACTC TTTTTC AGC TTAT ATATTTT ACTTTTAAGGAACC C AAAA 
TTGATGATGTTTCTTTGCAAAATGCTATTAATAATATAAAAGCATTAATAAAGAGCAATGAAAATAGTTCTGATTA 
TCATTTTCATAAAGCCATTAGTAAATTTTTAAACAATAATGATCCTAGATTTGAAGATACAAAAGATAGTGATTTG 
CAATATTTTACAAAAGAAAATATTTTGTCTTTTTATAAGAAAAGGTTTACTTATGCAAATAATTTTAAGTTTGTCT 
TGCTGGAGACTCAGATATTCAGACAATAA 

t373 .nt 

TGTGTTTCTAATGAGTTAAAGTTAGATCAAAGTTTGGTAAAAGGAAAACTTGTCAATGGGCTAAGGTATTATATTT 
ATAAAAATCAAACCCCAAAGAATGCCGTTAATATGGGAATTGTTTTTAATGTGGGCTCACTTAATGAAGAAGATAA 
TGAGAGGGGAATAGCGCATTATCTTGAACATATGGCTTTTAATGGTACAAAAGATTATCCAGGGAATTCTATAGTT 
GATGTTCTTAAAAAATTTGGAATGCAATTTGGTGCTGACATTAATGCTGCTACTAGTTTTGATTTCACTTATTATA 
GACTTGATTTGTCAGATGGTAATAATAAAGATGAAATTGATGAATCTATAAATATTTTGAGAAACTGGGCTTCTCA 
AATCAGTTTCATGAAAGAAGAAATAGATCTAGAGCGAAATATTATTATTGAGGAAAAAAAGCTTGGTGAGACTTAT 
CCTGGAAGAATTTATGAGAAAATGGATAAGTTTTTGACAAGCGGAAGTCTTTATGAATTTAGAAGTCCTATTGGAC 
TTGAAGAGCAAATTTTATCTTTTCAGCCAGAAGATTTTAAAAAATTTTATAGAAAGTGGTATAGGCCAGAACTTGC 
AAGTGTTATTGTGGTAGGAGATATTGATCCTATAGAAATTGAAGAGAAGATAAAGAAGCAATTTGTTTCTTGGAAA 
AATCCAACCGATAAAATTAAAGAAGTAAAAGTAAGTTTAGACGTAGAGCTTAAGGATAAATTTTTACTTTTAGAAG 
ATTTGGAAGTTGGAGAGCCTAGTTTAATGTTCTTTAAAAAGGAAATTATTAACTTTGTAAAGACCAAAGATGACCT 
TTTAAATGCTATTAAAAAGTCTTTATTAGCCGCTCTTTTTGAAAATAGATTTTCTGAATTAAAGACTGCTGGGGTA 
AAGCAATTTAAAAATGTTTCAAATAAAGATTTTTTCTCATTTAAATCAGATAACAATACCATTGTTGCAAAATCGA 
TTTCTTTAAACTTTAATCCAGATCATTTGAACGAAGGAATACAAGACTTTTTTTATGAGCTTGAGAGGATAAGAAA 
ATTTGGATTTACCCAAGGTGAGCTTGAAAAAGTTAGATCTCAATTTTACAAATCTTTAGAATTAAGGAAAAAGAAT 
ATAAATAAAACAAATTCATGGGCTATTTTTCAGGATTTAATAGAAATTGCTATTAATGGTTCTAATAAATTTGATA 
TGAATGAATATTGCGATCTTTCTTTTCAATATTTGGAAAAGATTGATTTAAAAACAATAAACAATCTTGTAGGAAG 
AGAGTTTGATGTAAAAAATTGTGCAATTTTTTATTCTTACCATGGAAGAGCACATCCTGTTTTAACTCTTGAAGAT 
ATTGACAATCTTCAAAAGATAGCTTTAAAAAGAGAGTTAAAGCCTTATGAGAATTCTTTAATTGAAGGTAAATTTT 
TTAAGAAGTCTTTAGATGATAAAGATATTATTAGAGAAAATGAGTTTGAAAATGAAATTTCGTCATTTGTTCTTGA 
AAATGGGGTTGAAGTTTATTTTAAATATAATGATCAAAAAAAAGGTGTAATTGATTTTAGTGCAACTTCTTGGGGA 
GGTTTAATTAATGAAGATTTAAAACTTATTCCTGTTTTATCTTTTGCTCCCGGAGTAGTATCTGGTTCGGGTTATG 
GTGATTATTCTGCATTACAGATTGAAAAATATTTATCAGATAAAGCTGTTTCTTTAAGAGTTGGGGTTGGAGCTCA 
AGAATCATATATTTCTGGAAGTTCAGATAAAAAAGATCTTGAAACTCTTTTTCAGCTTATATATTTTACTTTTAA.G 
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GAACCCAAAATTGATGATGTTTCTTTGCAAAATGCTATTAATAATATAAAAGCATTAATAAAGAGCAATGAAAATA 
GTTCTGATTATCATTTTCATAAAGCCATTAGTAAATTTTTAAACAATAATGATCCTAGATTTGAAGATACAAAAGA 
TAGTGATTTGCAATATTTTACAAAAGAAAATATTTTGTCTTTTTATAAGAAAAGGTTTACTTATGCAAATAATTTT 
AAGTTTGTCTTGCTGGAGACTCAGATATTCAGACAATAA 

f384 .aa 

MDWDFEKIIFLLNESTRLALSGCAKLILDFKSDGSIVTQVDKQIEQFLFKEIKKPGNFVLGSETISTYKEEYIKDA 
LISESTFIIDPIDGTSSFAAGLPSYGISLAYASGGKIIEGAISLPLSGEFFITSKDNVFYAKKNIGSYPLKKDFNK 
FIFDNSKCYNIHSLLAVSRSIIRLFNLDISSHIHINGSCVYSFAKLFTGSYKAYFSFVGLWDIAACLAIGNKLGMV 
GEF YCGNKMT LD I LDSMYILE PNNHKRWS LKDFFI YSDNKST I DI IRKDANKKINK 

t384.aa 

..CAKLILDFKSDGSIVTQVDKQIEQFLFKEIKKPGNFVLGEETISTYKEEYIKDALISESTFIIDPIDGTS5FAAGL 
PSYGISLAYASGGKIIEGAISLPLSGEFFITSKDNVFYAKKNIGSYPLKKDFNKFIFDNSKCYNIKSLLAVSRSII 
RLFNLDISSHIHINGSCVYSFAKLFTGSYKAYFSFVGLWDIAACLAIGNKLGMVGEFYCGNKMTLDILDSMYILEP 
NNHKRWSLKDFFIYSDNKSTIDIIRKDANKKINKZ 

f384.nt 

ATGGATTGGGATTTTGAAAAAATTATATTTTTATTAAATGAATCAACTA-GGCTTGCATTAAGTGGTTGTGCTAAAT 
TAATTTTAGATTTTAAATCTGATGGGTCTATTGT.AACTCAGGTTGATAAGCAAATTGAGCAATTCTTATTCAAAGA 
GATCAAAAAGCCTGGAAATTTTGTTCTTGGAGAAGAGACAATATCTACTTATAAAGAAGAGTATATCAAAGATGCT 
TTAATATCAGAGAGTACTTTTATTATTGATCCTATTGATGGAACTTCTTCTTTTGCAGCAGGCCTTCCTTCATATG 
GAATATCGCTAGCGTATGCTAGTGGCGGCAAAATTATTGAAGGAGCCATTTCTCTTCCTTTAAGCGGAGAGTTTTT 
TATTACTTCTAAAGATAATGTATTTTATGCTAAAAAAAACATTGGTAGCTATCCTTTAAAAAAGGATTTTAATAAA 
TTTATTTTTGATAATTCTAAATGTTACAATATTCATAGTTTACTTGCAGTTTCAAGGTCTATTATAAGGTTATTTA 
ATCTTGATATTTCTTCTCATATTCATATTAATGGTTCTTGTGTATATTCTTTTGCTAAACTTTTTACAGGTTCTTA 
TAAGGCCTACTTTTCTTTTGTAGGACTTTGGGATATTGCAGCGTGTTTAGCTATTGGTAATAAATTGGGCATGGTT 
GGCGAATTTTATTGTGGTAATAAAATGACATTAGATATCTTAGATTCAATGTATATTTTAGAGCCTAATAATCATA 
AAAGATGGTCCTTGAAAGATTTTTTTATTTATTCTGATAATAAATCAACAATAGACATTATAAGAAAAGATGCAAA 
TAAAAAAATCAATAAGTAA 

t384.nt 

AGTGGTTGTGCTAAATTAATTTTAGATTTTAAATCTGATGGGTCTATTGTAACTCAGGTTGATAAGCAAATTGAGC 
AATTCTTATTCAAAGAGATCAAAAAGCCTGGAAATTTTGTTCTTGGAGAAGAGACAATATCTACTTATAAAGAAGA 
GTATATCAAAGATGCTTTAATATCAGAGAGTACTTTTATTATTGATCCTATTGATGGAACTTCTTCTTTTGCAGCA 
GGCCTTCCTTCATATGGAATATCGCTAGCGTATGCTAGTGGCGGCAAAATTATTGAAGGAGCCATTTCTCTTCCTT 
TAAGCGGAGAGTTTTTTATTACTTCTAAAGATAATGTATTTTATGCTAAAAAAAACATTGGTAGCTATCCTTTAAA 
AAAGGATTTTAATAAATTTATTTTTGATAATTCTAAATGTTACAATATTCATAGTTTACTTGCAGTTTCAAGGTCT 
ATTATAAGGTTATTTAATCTTGATATTTCTTCTCATATTCATATTAATGGTTCTTGTGTATATTCTTTTGCTAAAC 
TTTTTACAGGTTCTTATAAGGCCTACTTTTCTTTTGTAGGACTTTGGGATATTGCAGCGTGTTTAGCTATTGGTAA 
TAAATTGGGCATGGTTGGCGAATTTTATTGTGGTAATAAAATGACATTAGATATCTTAGATTCAATGTATATTTTA 
GAGCCTAATAATCATAAAAGATGGTCCTTGAAAGATTTTTTTATTTATTCTGATAATAAATCAACAATAGACATTA 
TAAGAAAAGATGCAAATAAAAAAATCAATAAGTAA 

f 860 . aa 

MAFYKLNDNIALAEDLLKYLLSSILNECSQDMDFLENYIEKGLIKKLEI>TVINSNFEVITYTKAIEILENSKKNFEI 
KPYWGIDLQTDHERYLTEETFKKP^/WIDYPKNFKAFYMFANKDISIKTVKGMDILVPKIGEIIGGSEREDDLQKLEW 
RIKELNLNIEHLNWYLDLRRFGSAPHSGFGLGLERLVQYSTGISNIRDSIPFPRTPKNLYFZ 



t860 .aa 
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CSQDMDFLENYIEKGLIKKLENVINSNFEVITYTKAIEILENSKKNFSIKPYWGIDLQTDHERYLTEETFKKPVW 
IDYPKNFKAFYMKANKDNKTVKGMDILVPKIGEIIGGSEREDDLQKLENRIKELNLNIEHLNWYLDLRRFGSAPHS 
GFGLGLERLVQYSTGI SNIRDS IPFPRT PKNLYF Z 

f860 .nc 

ATGGCTTTTTATAAGCTTAACGACAATATTGCCCTAGCAGAAGATCTCTTGAAATATCTTTTAAGTTCAATTTTAA 
ACGAATGCTCACAAGATATGGATTTTTTAGAAAATTACATTGAAAAAGGTTTAATTAAAAAACTAGAAAATGTAAT 
AAATTCAAATTTTGAGGTTATTACCTATACTAAAGCAATTGAAATTCTTGAAAACTCAAAAAAAAATTTTGAAATA 
AAACCTTACTGGGGAATAGATTTGCAAACAGATCACGAAAGATACCTAACAGAAGAGACTTTTAAAAAACCGGTAG 
TGGTCATTGATTATCCAAAAAATTTCAAAGCATTTTACATGAAAGCAAATAAAGACAATAAAACTGTTAAAGGAAT 
GGACATACTTGTTCCAAAAATTGGAGAGATTATAGGGGGAAGCGAAAGAGAAGATGACCTTCAAAAATTAGAAAAT 
AGAATAAAAGAATTAAACTTAAACATTGAACATCTAAACTGGTATCTTGATCTAAGAAGATTTGGCTCGGCTCCTC 
ATTCTGGCTTTGGACTTGGACTTGAAAGATTGGTGCAATACTCAACAGGAATATCTAATATAAGAGATTCAATACC 
ATTCCCAAGGACTCCTAAAAATCTTTATTTTTAA 

t860.nt 

TGCTCACAAGATATGGATTTTTTAGAAAATTACATTGAAAAJ^GGTTTAATTAAAAAACTAGAAAATGTAATAAATT 
CAAATTTTGAGGTTATTACCTATACTAAAGCAATTGAAATTCTTGAAAACTCAAAAAAAAATTTTGAAATAAAACC 
TTACTGGGGAATAGATTTGCAAACAGATCACGAAAGATACCTAACAGAAGAGACTTTTAAAAAACCGGTAGTGGTC 
ATTGATTATCCAAAAAATTTCAAAGCATTTTACATGAAAGCAAATAAAGACAATAAAACTGTTAAAGGAATGGACA 
TACTTGTTCCAAAAATTGGAGAGATTATAGGGGGAAGCGAAAGAGAAGATGACCTTCAAAAATTAGAAAATAGAAT 
AAAAGAATTAAACTTAAACATTGAACATCTAAACTGGTATCTTGATCTAAGAAGATTTGGCTCGGCTCCTCATTCT 
GGCTTTGGACTTGGACTTGAAAGATTGGTGCAATACTCAACAGGAATATCTAATATAAGAGATTCAATACCATTCC 
CAAGGACTCCTAAAAATCTTTATTTTTAA 

f -44 6 . aa 

MKILRLCLLFLFFACTFDYDEYSSRSDVAKKFPSIQILGIKYYDWYNKEQTVLNSLSFSYFNDYKIYKAENGRFL 
YHSLDNEISGKFNNLEGSYITKDLDMRDSVEFKIEDKNNYYLLNSOTILLWKNKDKKLQSPPNELVLIRFNDSKING 
KGFSYFLKSNVFYFDSGVEGIMNZ 

t446 . aa 

CTFDYDEYSSRSDVAKKFPSIQILGIKYYDWYNKEQTVLNSLSFSYFNDYKIYKAENGRFLYHSLDNEISGKFNN 
LEGSYITKDLDMRDSVEFKIEDKNNYYLLNSmLLWKNKDKKLQSPPNELVLIRFTSIDSKINGKGFSYFLKSNVFYF 
DSGVEGIMNZ 

f44 6 .nt 

ATGAAAATACTTAGACTTTGTTTGTTGTTTTTGTTTTTTGCTTGTACTTTTGATTATGATGAGTATTCTAGTAGAT 
CTGATGTGGCCAAAAAGTTTCCTTCAATACAAATATTAGGAATCAAGTATTATGATGTTGTATACAATAAAGAGCA 
AACCGTTTTAAATTCTTTAAGCTTTAGTTATTTCAATGACTATAAAATTTATAAGGCAGAGAATGGAAGGTTTTTA 
TATCATTCCCTAGATAATGAAATTTCAGGGAAGTTTAATAATTTGGAAGGTTCTTATATTACAAAGGATTTGGATA 
TGAGAGATTCTGTAGAATTTAAAATAGAAGATAAAAATAATTATTATTTGCTTAATTCAAATAGGCTTTTATGGAA 
GAATAAAGACAAGAAGTTGCAATCCCCCCCAAATGAGCTAGTATTAATTAGATTTAATGATAGCAAAATAAACGGA 
AAAGGATTTTCTTATTTTTTAAAGAGCAATGTTTTTTATTTTGATTCTGGAGTTGAAGGAATCATGAATTGA 

t446 .nt 

TGTACTTTTGATTATGATGAGTATTCTAGTAGATCTGATGTGGCCAAAAAGTTTCCTTCAATACAAATATTAGGAA 
TCAAGTATTATGATGTTGTATACAATAAAGAGCAAACCGTTTTAAATTCTTTAAGCTTTAGTTATTTCAATGACTA 
TAAAATTTATAAGGCAGAGAATGGAAGGTTTTTATATCATTCCCTAGATAATGAAATTTCAGGGAAGTTTAATAAT 
TTGGAAGGTTCTTATATTACAAAGGATTTGGATATGAGAGATTCTGTAGAATTTAAAATAGAAGATAAAAATAATT 
ATTATTTGCTTAATTCAAATAGGCTTTTATGGAAGAATAAAGACAAGAA-GTTGCAATCCCCCCCAAATGAGCTAGT 



• # 
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ATTAATTAGATTTAATGATAGCAAAATAAACGGAAAAGGATTTTCTTATTTTTTAAAGAGCAATGTTTTTTATTTT 
GATTC TGGAGTTGAAGGAATC ATGAATTGA 

£457. aa 

MKQKLSWILLFCFLSCRSESRLAENVLIEFFDSIKNFQSSPEIFFNYLNIPSDDDLKAKIRGLKSQAKDDFIFYPL 
FFNNLRYEIIGRKNISKGFEFEWIKNINFQNGIEKFLAKLNKIEGRSLNIKNLEKKERKKIFDNLINEVIGELDD 
FDYTEWHFFRWKSSSESYKIELLGDVLNIQSRNKLINDLFLVLSPGIZ 

t457 .aa 

CFLSCRSESRLAENVLIEFFDSIKNFQSSPEIFFNYLNIPSDDDLKAKIRGLKSQAKDDFIFYPLFFNNLRYEIIG 
RIOSriSKGFEFEVVIKNINFQNGIEKFLAKLNKIEGRSLNIKNLEKKERKKIFDNLINEVIGELDDFDYTEWHFFR 
WKSSSESYKIELLGDVLNIQSRNKLINDLFLVLSPGIZ 

f 457 .nt 

ATGAAGCAAAAATTAAGTTGGATTTTATTATTTTGTTTTTTGTCTTGTAGATCTGAATCTAGATTGGCTGAAAATG 
TTTTAATAGAGTTTTTTGATTCTATTAAAAATTTTCAAAGCAGTCCTGAAATATTTTTTAATTATTTAAATATTCC 
AAGTGATGATGATCTGAAGGCAAAAATTCGTGGGTTGAAATCTCAGGCAAAGGATGATTTCATTTTTTATCCTTTG 
TTTTTTAATAATCTAAGATATGAGATAATAGGTAGAAAAAATATTTCTAAGGGCTTTGAATTTGAAGTTGTTATTA 
AAAATATTAACTTTCAAAACGGTATAGAAAAATTTTTGGCTAAATTAAATAAAATTGAAGGGAGATCTTTAAATAT 
TAAAAATTTAGAAAAAAAAGAGCGTAAAAAAATATTTGACAATTTAATAAATGAAGTTATTGGAGAGTTGGATGAT 
TTTGATTACACTGAAGTTGTTCATTTTTTTAGAGTAGTTAAGAGTTCTTCTGAAAGTTATAAAATAGAGCTTTTAG 
GAGATGTTTTAAATATACAGTCTAGAAATAAGCTTATTAATGATCTTTTTTTGGTTTTATCGCCTGGAATTTAA 

t457.nt 

TGTTTTTTGTCTTGTAGATCTGAATCTAGATTGGCTGAAAATGTTTTAATAGAGTTTTTTGATTCTATTAAAAATT 
TTCAAAGCAGTCCTGAAATATTTTTTAATTATTTAAATATTCCAAGTGATGATGATCTGAAGGCAAAAATTCGTGG 
GTTGAAATCTCAGGCAAAGGATGATTTCATTTTTTATCCTTTGTTTTTTAATAATCTAAGATATGAGATAATAGGT 
AGAAAAAATATTTCTAAGGGCTTTGAATTTGAAGTTGTTATTAAAAATATTAACTTTCAAAACGGTATAGAAAAAT 
TTTTGGCTAAATTAAATAAAATTGAAGGGAGATCTTTAAATATTAAAAATTTAGAAAAAAAAGAGCGTAAAAAAAT 
ATTTGACAATTTAATAAATGAAGTTATTGGAGAGTTGGATGATTTTGATTACACTGAAGTTGTTCATTTTTTTAGA 
GTAGTTAAGAGTTCTTCTGAAAGTTATAAAATAGAGCTTTTAGGAGATGTTTTAAATATACAGTCTAGAAATAAGC 
TTATTAATGATCTTTTTTTGGTTTTATCGCCTGGAATTTAA 

f 542 .aa 

MRIVIFIFGILLTSCFSRNGIESSSKKIKISMLVDGVLDDKSFNSSANEALLRLKKDFPENIEEVFSCAISGVYSS 
YVSDLDNLKRNGSDLIWLVGYMLTDASLLVSSENPKISYGIIDPIYGDDVQIPENLIAWFRVEPRCFFGWLYCSQ 
KKLFWQNRFYRGNEGZ 

t542 .aa 

" CFSRNGIESSSKKIKISMLVDGVLDDKSFNSSANEALLRLKKDFPENIEEVFSCAISGVYSSYVSDLDNLKRNGSD 
LIWLVGYMLTDASLLVSSENPKISYGIIDPIYGDDVQIPENLIAWFRVEPRCFFGWLYCSQKKLFWQNRFYRGNE 
GZ 

f5 42 .nt 

ATGAGAATTGTAATTTTTATATTCGGTATTTTGTTGACTTCTTGCTTTAGTAGAAATGGAATAGAATCTAGTTCAA 
AAAAAATTAAGATATCCATGTTGGTAGATGGTGTTCTTGACGACAAATCTTTTAATTCTAGTGCTAATGAGGCTTT 
ATTACGCTTGAAAAAAGATTTTCCAGAAAATATTGAAGAAGTTTTTTCTTGTGCTATTTCTGGAGTTTATTCTAGT 
TATGTTTCAGATCTTGATAATTTAAAAAGGAATGGCTCAGACTTGATTTGGCTTGTAGGGTACATGCTTACGGACG 
CATCTTTATTGGTTTCATCGGAGAATCCAAAAATTAGCTATGGAATAATAGATCCCATTTATGGTGATGATGTTCA 
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GATTCCTGAAAACTTGATTGCTGTTGTTTTCAGAGTAGAGCCAAGGTGCTTTTTTGGCTGGCTATATTGCAGCCAA 
AAAAAGCTTTTCTGGCAAAATAGGTTTTATAGGGGGAATGAAGGGTAA 

t'5 42 .nt 

TGCTTTAGTAGAAATGGAATAGAATCTAGTTCAAAAAAAATTAAGATATCCATGTTGGTAGATGGTGTTCTTGACG 
A.CAAATCTTTTAATTCTAGTGCTAATGAGGCTTTATTACGCTTGAAAAAAGATTTTCCAGAAAATATTGAAGAAGT 
TTTTTCTTGTGCTATTTCTGGAGTTTATTCTAGTTATGTTTCAGATCTTGATAATTTAAAAAGGAATGGCTCAGAC 
TTGATTTGGCTTGTAGGGTACATGCTTACGGACGCATCTTTATTGGTTTCATCGGAGAATCCAAAAATTAGCTATG 
GAATAATAGATCCCATTTATGGTGATGATGTTCAGATTCCTGAAAACTTGATTGCTGTTGTTTTCAGAGTAGAGCC 
AAGGTGCTTTTTTGGCTGGCTATATTGCAGCCAAAAAAAGCTTTTCTGGCAAAATAGGTTTTATAGGGGGAATGAA 
GGGTAA 

f93.aa 

MKRILAMHDISSMGRTSLTICIPVISSFNMQVCPFVTAVLSASTAYKKFEI^/DLTDHLEKFINIWKEQNEHFDILY 
.TGFLGSEKQQITIEKIIKLIKFEKIVIDPVFADDGEIYPIFDNKIISGFRKIIKYANIITPNITELEMLSKSSKLN 
NKDDIIKAILNLDTKATWVTSVKRGNLLGNICYNPKNKEYSEFFLEGLEQNFSGTGDLFTSLLIGYLEKFETEQA 
LEKTTKAI HL I I KES I KENVSKKEGVRI ENFLKNTFZ 

t93 .aa 

CIPVISSFNMQVCPFVTAVLSASTAYKKFEIVDLTDHLEKFINIWKEQNEHFDILYTGFLGSEKQQITIEKIIKLI 
KFEKIVIDPVFADDGEIYPIFDNKIISGFRKIIKYANIITPNITELEMLSKSSKLNNKDDIIKAILNLDTKATVW 
TSVKRGNLLGNICYNPKNKEYSEFFLEGLEQNFSGTGDLFTSLLIGYLEKFETEQALEKTTKAIHLIIKESIXENV 
SKKEGVRI ENFLKNTFZ 

f93.at 

ATGAAAAGAATTTTAGCAATGCATGATATTTCAAGCATGGGAAGAACATCTCTTACAATATGCATACCAGTAATAT 
CTTCGTTTAATATGCAAGTTTGTCCTTTTGTGACAGCTGTCCTTTCTGCTTCCACAGCTTATAAAAAATTTGAAAT 
AGTGGATTTAACCGATCATTTAGAAAAATTTATCAATATATGGAAAGAACAAAATGAGCACTTTGACATACTCTAT 
ACCGGATTTCTGGGAAGCGAAAAACAACAAATAACAATAGAGAAAATAATTAAATTAATAAAATTTGAAAAAATTG 
TAATTGATCCTGTGTTTGCTGACGATGGAGAAATTTACCCTATATTTGATAATAAAATAATTAGTGGATTTAGAAA 
AATCATAAAGTACGCAAACATAATAACACCCAATATCACAGAACTTGAAATGCTAAGCAAAAGCTCAAAACTTAAC 
AACAAAGATGATATCATAAAAGCAATATTAAATCTTGATACAAAAGCGACGGTAGTTGTTACAAGCGTTAAAAGGG 
GAAATCTCTTGGGAAACATTTGCTACAATCCTAAAAACAAAGAATACTCGGAGTTTTTTTTAGAAGGATTAGAACA 
AAATTTCAGTGGAACAGGAGATTTATTTACCAGCTTACTTATAGGATATTTGGAAAAATTTGAAACAGAGCAAGCC 
TTAGAAAAAACAACAAAGGCTATTCACCTAATAATAAAAGAGTCAATTAAAGAAAATGTTTCAAAAAAAGAAGGGG 
TC CGAATTGAAAATTTCTTAAAAAATAC ATTTTGA 

t93 .nt 

TGCATACCAGTAATATCTTCGTTTAATATGCAAGTTTGTCCTTTTGTGACAGCTGTCCTTTCTGCTTCCACAGCTT 
ATAAAAAATTTGAAATAGTGGATTTAACCGATCATTTAGAAAAATTTATCAATATATGGAAAGAACAAAATGAGCA 
CTTTGACATACTCTATACCGGATTTCTGGGAAGCGAAAAACAACAAATAACAATAGAGAAAATAATTAAATTAATA 
AAATTTGAAAAAATTGTAATTGATCCTGTGTTTGCTGACGATGGAGAAATTTACCCTATATTTGATAATAAAATAA 
TTAGTGGATTTAGAAAAATCATAAAGTACGCAAACATAATAACACCCAATATCACAGAACTTGAAATGCTAAGCAA 
AAGCTCAAAACTTAACAACAAAGATGATATCATAAAAGCAATATTAAATCTTGATACAAAAGCGA.CGGTAGTTGTT 
ACAAGCGTTAAAAGGGGAAATCTCTTGGGAAACATTTGCTACAATCCTAAAAACAAAGAATACTCGGAGTTTTTTT 
TAGAAGGATTAGAACAAAATTTCAGTGGAACAGGAGATTTATTTACCAGCTTACTTATAC-GATATTTGGAAAAATT 
TGAAACAGAGCAAGCCTTAGAAAAAACAACAAAGGCTATTCACCTAATAATAAAAGAGTCAATTAAAGAAAATGTT 
TCAAAAAAAGAAGGGGTCCGAATTGAAAATTTCTTAAAAAAT AC ATTTTGA 



fl05 .aa 
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Z^GLYLKLLRQSINLKSLFPLSVLFFSCNVVDTDFSVLEFKVANFNLNDDFSQGLLDSAYNILNRSFDLIIIKNLKN 
K3WLDLINNRVLFRAFKNAYFIDQGS 

DFIKSIANLQISEQILYLKAQMDKLMFILDESEFVIFDLLIKNGFFSLINDSNYTSMLANKIDFRVFSNFFARVSL 
YSFMFVIADYLHSNYWENFPQKIVINZ 

-105. aa 

CNVVDTDFSVLEFKVANFNLNDDFSQGLLDSAYNILNRSFDLII^ 

GLSVSILSKRKINIKVLSVMQDSCDLKLGLLVDFKFENNHYGIVIYNLSKDFIKSIANLQISEQILYLKAQMDKLM 
FILDESEFVIFDLLIK^GFFSLINDSJSWTSMLANKIDFRVFSNFFARVSLYSFMFVIADYLHSNYWENFPQKIVI 
NZ 

fl05 .nt 

ATGGGCTTGTATTTGAAGTTGTTGAGACAAAGTATCAACTTGAAGAGTTTATTTCCGCTTAGTGTTTTATTTTTTT 
Q CCTGTAATGTTGTAGATACAGATTTTAGTGTTTTGGAGTTTAAGGTTGCAAATTTTAATTTAAATGATGATTTTTC 

■TCAAGGGTTACTTGATTCTGCTTATAATATTCTAAATCGAAGTTTTGATTTAATAATTATTAAGAATCTTAAGAAT 
- : l AAAAATGTTCTTGATTTAATTAATAATAGAGTTTTATTTAGAGCTTTTAAGAATGCTTATTTTATTGATCAAGGTA 
l|,J GTGGCCTTTCTGTTAGCATTCTTTCTAAGCGCAAAATAAATATTAAAGT.TTTAAGTGTAATGCAAGATTCTTGCGA 
Q' TTTAAAATTAGGATTGCTTGTGGATTTTAAATTTGAGAATAATCACTATGGTATTGTTATTTATAATTTAAGCAAG 
1 11 GATTTTATTAAAAGTATTGCCAATTTGC AAATTAGTGAAC AAATTTTATATTTAAAAGCCC AAATGGATAAATTGA 

1 ij TGTTTATTTTAGATGAATCTGAATTTGTTATTTTTGATTTATTAATCAAAAATGGATTTTTTAGCTTAATAAATGA 
{m s TTCAAACTACACTTCAATGTTAGCAAATAAAATTGATTTTAGAGTTTTTTCTAATTTTTTTGCTAGGGTTTCTTTA 
r TATTCATTTATGTTTGTAATTGCAGATTATTTGCATAGCAATTATGTTGTTGAGAATTTTCCTCAAAAAATAGTTA 
Q TCAATTGA 

\ tlOS.nt 

,2 TGTAATGTTGTAGATACAGATTTTAGTGTTTTGGAGTTTAAGGTTGCAAATTTTAATTTAAATGATGATTTTTCTC 
]'3 AAGGGTTACTTGATTCTGCTTATAATATTCTAAATCGAAGTTTTGATTTAATAATTATTAAGAATCTTAAGAATAA 
AAATGTTCTTGATTTAATTAATAATAGAGTTTTATTTAGAGCTTTTAAGAATGCTTATTTTATTGATCAAGGTAGT 
GGCCTTTCTGTTAGCATTCTTTCTAAGCGCAAAATAAATATTAAAGTTTTAAGTGTAATGCAAGATTCTTGCGATT 
TAAAATTAGGATTGCTTGTGGATTTTAAATTTGAGAATAATCACTATGGTATTGTTATTTATAATTTAAGCAAGGA 
TTTTATTAAAAGTATTGCCAATTTGCAAATTAGTGAACAAATTTTATATTTAAAAGCCCAAATGGATAAATTGATG 
TTTATTTTAGATGAATCTGAATTTGTTATTTTTGATTTATTAATCAAAAATGGATTTTTTAGCTTAATAAATGATT 
CAAACTACACTTCAATGTTAGCAAATAAAATTGATTTTAGAGTTTTTTCTAATTTTTTTGCTAGGGTTTCTTTATA 
TTCATTTATGTTTGTAATTGCAGATTATTTGCATAGCAATTATGTTGTTGAGAATTTTCCTCAAAAAATAGTTATC 
AATTGA 

f!50.aa 

MKTFVIIGLSNLGIHLLEDLSRLDCQIIIIDTSKELIEEYDVISTESFWEQFTKNALKRIIPVDTDAWIDFDDD 
LGKSALVTHYCNLLGLKEICVKTENRDDAEILKTLGATKIIFPSKDAARRLTPLLVSPNLSTYNIIGYDIIVAETV 
IPKEWGKTLFEADLRRECGITVIAVRNLSNSRYEFVDGDYFFLKDDKIVICGKPDSIENFTNNKDLIKDLISGSK 
EDENLNKDAEKKSRFLGIFNFMKIFQKDRKDNZ 

tl50.aa 

CQIIIIDTSKELIEEYDVISTESFWEQFTKNALKRIIPVDTDAWIDFDDDLGKSALVTHYCNLLGLKEICVKTE 
NRDDAEILKTLGATKIIFPSKDAARRLTPLLVSPNLSTYNIIGYDIIVAETVIPKEYVGKTLFEADLRRECGITVI 
A^/RNLSNSRYEFVDGDYFFLKDDKIVICGKPDSIENFTNNKDLIKDLISGSKEDENLNKDAEKKSRFLGIFNFMKI 
FQKDRKDNZ 

f 150 .r.t 
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ATGAAAACATTTGTTATTATTGGACTTAGTAATTTAGGCATTCACTTACTTGAAGATTTAAGCAGGCTTGATTGTC 
AAATTATTATTATAGATACATCTAAAGAGCTTATTGAAGAATATGATGTGATATCTACAGAAAGCTTTGTTGTTGA 
GCAATTCACTAAAAATGCTTTGAAAAGAATAATTCCAGTAGATACAGACGCTGTTGTTATTGATTTTGATGATGAT 
CTTGGCAAAAGTGCTCTTGTTACTCACTATTGTAATCTTTTAGGTTTGAAAGAAATATGCGTTAAGACAGAAAATA 
GAGATGATGCTGAAATCTTAAAAACTCTTGGGGCAACAAAAATTATATTTCCAAGTAAAGATGCTGCAAGAAGATT 
AACTCCATTATTAGTATGTCCAAATCTTTCAACTTATAATATTATTGGGTATGATATTATTGTTGCTGAAACTGTT 
ATTCCCAAAGAATATGTTGGTAAAACTCTTTTTGAAGCCGATCTTAGAAGAGAATGTGGGATTACAGTTATTGCTG 
TTAGAAATTTAAGTAATTCTAGGTATGAATTTGTTGATGGCGATTATTTTTTTTTAAAAGATGATAAAATTGTAAT 
TTGTGGTAAACCAGATAGCATTGAAAATTTTACAAATAATAAAGATTTAATTAAAGATTTAATTTCAGGCTCTAAA 
GAGGATGAAAATTTAAATAAAGATGCTGAGAAAAAATCTAGATTTTTAGGGATTTTCAATTTTATGAAAATTTTTC 
AAAAAGATCGTAAGGATAATTAG 



TGTCAAATTATTATTATAGATACATCTAAAGAGCTTATTGAAGAATATGATGTGATATCTACAGAAAGCTTTGTTG 
TTGAGCAATTCACTAAAAATGCTTTGAAAAGAATAATTCCAGTAGATACAGACGCTGTTGTTATTGATTTTGATGA 
TGATCTTGGCAAAAGTGCTCTTGTTACTCACTATTGTAATCTTTTAGGTTTGAAAGAAATATGCGTTAAGACAGAA 
AATAGAGATGATGCTGAAATCTTAAAAACTCTTGGGGCAACAAAAATTATATTTCCAAGTAAAGATGCTGCAAGAA 
GATTAACTCCATTATTAGTATCTCCAAATCTTTCAACTTATAATATTATTGGGTATGATATTATTGTTGCTGAAAC 
TGTTATTCCCAAAGAATATGTTGGTAAAACTCTTTTTGAAGCCGATCTTAGAAGAGAATGTGGGATTACAGTTATT 
GCTGTTAGAAATTTAAGTAATTCTAGGTATGAATTTGTTGATGGCGATTATTTTTTTTTAAAAGATGATAAAATTG 
TAATTTGTGGTAAACCAGATAGCATTGAAAATTTTACAAATAATAAAGATTTAATTAAAGATTTAATTTCAGGCTC 
TAAAGAGGA-TGAAAATTTAAATAAAGATGCTGAGAAAAAATCTAGATTTTTAGGGATTTTCAATTTTATGAAAATT 
TTTCAAAAAGATCGTAAGGATAATTAG 



MLIAEIMNINTLFYGMIIIIFALISCNHKNIQYDKRIKKFLDKNKIEYKIDSENDFIAFKDINKMEKEEVIIRSRL 
NSYKNSKIREIFGIVKVFDINTPKIKEISDSLMSDSYNWRVFGSWEIIHNAERGINSLVYIVKAEEFANDTFLLDA 
IDEIASTISIFKKIITTNNENIDNNEENNNTNESNEQPTLKQEKTNSTKESNNELKEDQIEEELQEIKAQZ 



CNH'KNIQYDKRIKKFLDKNKIEYKIDSENDFIAFKDINlvINEKEEVIIRSRLNSYKNSKIRErFGIVKVFDINTPKI 
KEISDSLMSDSYl^RVFGSWEIIHNAERGINSLVYIVKAEEFANDTFLLDAIDEIASTISIFKKIITTNNENIDNN 
EENNNTNESNEQPTLKQEKTNSTKESNNELKEDQIEEELQEIKAQZ 



ATGCTAATTGCAAGAATAATGAATATTAATACATTATTCTACGGCATGATCATTATCATTTTTGCACTCATTTCTT 
GCAATCATAAGAATATACAGTACGACAAGAGAATTAAAAAATTTTTAGATAAAAACAAAATTGAATATAAAATAGA 
CTCAGAAAATGACTTTATAGCATTTAAAGATATAAACAATAACGAAAAAGAAGAAGTAATCATCAGATCAAGACTA 
AACTCATATAAAAATTCAAAGATAAGAGAAATATTTGGAATTGTTAAAGTATTTGATATAAACACACCAAAAATAA 
AAGAAATATCTGACTCGCTTATGAGCGATAGTTATAATAACAGAGTATTTGGATCGTGGGAGATTATTCATAATGC 
AGAAAGAGGAATCAACTCTTTGGTATATATTGTAAAAGCAGAAGAATTTGCAAATGATACATTTTTGCTTGATGCA 
ATTGATGAGATTGCCTCAACAATAAGTATTTTCAAAAAAATAATAACAACCAACAACGAAAACATTGATAATAATG 
AAGAAAATAACAATACAAATGAATCAAATGAACAGCCCACCTTAAAGCAAGAAAAAACAAATTCAACAAAAGAATC 
TAATAACGAACTTAAAGAAGATCAAATAGAAGAAGAACTTCAAGAAATCAAAGCCCAATAA 



TGCAATCATAAGAATATACAGTACGACAAGAGAATTAAAAAATTTTTAGATAAAAACAAAATTGAATATAAAATAG 
ACTCAGAAAATGACTTTATAGCATTTAAAGATATAAACAATAACGAAAAAGAAGAAGTAATCATCAGATCAAGACT 
AAACTCATATAAAAATTCAAAGATAAGAGAAATATTTGGAATTGTTAAAGTATTTGATATAAACACACCAAAAATA 
AAAGAAATATCTGACTCGCTTATGAGCGATAGTTATAATAlACAGAGTATTTGGATCGTGGGAGATTATTCATAATG 
CAGAAAGAGGAATCAACTCTTTGGTATATATTGTAAAAGCAGAAGAATTTGCAAATGATACA.TTTTTGCTTGATGC 



tl5 0 .nt 



f2-19 .aa 



t219 .aa 



f219 .nt 



t219 .nt 
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AATTGATGAGATTGCCTCAACAATAAGTATTTTCAAAAAAATAATAACAACCAACAACGAAAACATTGATAATAAT 
GAAGAAAATAACAATACAAATGAATCAAATGAACAGCCCACCTTAAAGCAAGAAAAAACAAATTCAACAAAAGAAT 
CTAATAACGAACTTAAAGAAGATCAAATAGAAGAAGAACTTCAAGAAATCAAAGCCCAATAA 

£229. aa 

MRVDLLPLVELSLYINLSFCCXDFSIFNRILEELKCHLILLGHPIIKTLYIKHVDFCLSRQDNLKFIFTSLSKYIN 
LELLEEFTLEIIPGYVDFEKFKLLDEFCITRINLNVQSFSLEFRKIVGIPEISYKKLNILINNIRKFPFDLNIDMT 
WMPLQKKSHLKRDLQRI AF I YAZ 

t229 .aa 

CKDFSIFNRILEELKCHLILLGHPIIKTLYIKHVDFCLSRQDNLKFIFTSLSKYINLELLEEFTLEIIPGYVDFEK 
FKLLDEFCITRINLNVQSFSLEFRKIVGIPEISYKKLNILINWIRKFPFDLNIDMTVNMPLQKKSHLKRDLQRIAF 
IYAZ 

£229. nt 

ATGAGAGTAGATCTTTTACCTCTTGTCGAGTTAAGTCTTTATATTAATTTGTCATTTTGTTGTAAAGATTTTAGCA 
TTTTTAATAGAATTTTAGAGGAATTAAAATGTCATTTAATCTTGCTGGGTCATCCAATTATAAAAACACTTTACAT 
TAAGCACGTAGATTTTTGTTTATCTAGGCAAGATAATTTAAAATTTATTTTCACTTCTTTGTCCAAGTATATTAAT 
TTGGAGTTATTAGAAGAATTTACTTTAGAAATTATTCCGGGTTATGTTGATTTTGAAAAATTCAAACTTTTGC-ATG 
AATTTTGTATTACTAGAATTAATCTTAATGTTCAAAGTTTTTCTTTAGAGTTTAGAAAGATTGTGGGGATACCCGA 
AATTTCTTATAAAAAATTGAATATTTTGATTAACAATATTAGAAAGTTTCCTTTTGATTTGAATATTGACATGACT 
GTCAATATGCCTTTGCAAAAAAAATCTCATCTCAAGCGAGATTTGCAAAGAATTGCTTTCATATATGCCTGA 

t229.nt 

TGTAAAGATTTTAGCATTTTTAATAGAATTTTAGAGGAATTAAAATGTCATTTAATCTTGCTGGGTCATCCAATTA 
TAAAAACACTTTACATTAAGCACGTAGATTTTTGTTTATCTAGGCAAGATAATTTAAAATTTATTTTCACTTCTTT 
GTCCAAGTATATTAATTTGGAGTTATTAGAAGAATTTACTTTAGAAATTATTCCGGGTTATGTTGATTTTGAAAAA 
TTCAAACTTTTGGATGAATTTTGTATTACTAGAATTAATCTTAATGTTCAAAGTTTTTCTTTAGAGTTTAGAAAGA 
TTGTGGGGATACCCGAAATTTCTTATAAAAAATTGAATATTTTGATTAACAATATTAGAAAGTTTCCTTTTGATTT 
GAATATTGACATGACTGTCAATATGCCTTTGCAAAAAAAATCTCATCTCAAGCGAGATTTGCAAAGAATTGCTTTC 
ATATATGCCTGA 

f22.aa 

mlktltkiitisclivgcaslpytppkqnlnylmellpganlyahvnliknrsiynslspkyksvlglisnlyfsy 
kkemmdfallimgnfpkdifwgihknrntes 

KYIGEIEKNEMFFWIQDPTLLLPNQIVSSKNLIPFSSGTLSINSLNQEEYIFKSLIKTNNPPILKILSKKLIPTVL 
TNMTNLTISSHIKTTIKDQNTVEIEFNIQKSSVESLIEKLASNIQT 

t22 .aa 

CAS LP YT P PKQNLNYLMEL LPGANL YAHVHL IKNRS I YNSLS PKYKS VLGL I SNLYF S YKKENNDF ALLIMGNF PK 
DIFWGIHKNFJSTTESIGNIFTNPKWKLKNSN^ 

'PTLLLPNQIVSSKNLIPFSSGTLSINSLNQEEYIFKSLIKTNNPPILKILSKKLIPTVLTNMTNLTISSHIKTTIK 
DQNT V EIEFNIQKS S VE S L I EK L ASNI QT 

f22.nt 

ATGTTAAAAACATTAACAAAAATAATTACCATTTCATGCCTCATAGTGGGATGCGCAAGCCTGCCTTACACTCCTC 
CAAAACAAAATCTAAATTACTTAATGGAACTTTTACCTGGCGCAAATTTATACGCCCATGTAAATTTAATTAAAAA 
CAGGTCTATTTATAACTCTTTAAGCCCTAAATATAAATCAGTTCTTGGGCTTATAAGCAATTTATACTTTAGCTAT 
AAAAAAGAAAATAACGATTTTGCTCTACTAATAATGGGTAATTTCCCAAAAGATATTTTCTGGGGAATTCATAAAA 
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ATAGAAATACAGAATCAATAGGCAATATATTTACAAATCCAAAATGGAAACTTAAAAATTCAAATATATACATTAT 
TCCAAACAAAGCTAGAACTAGCATTGCAATAACCCAAAAAGATATAACCGCAAAAGACAATAATATGCTAACAACA 
AAATATATTGGGGAAATAGAAAAAAATGAAATGTTTTTTTGGATTCAAGATCCAACATTATTGCTCCCAAACCAAA 
TAGTAAGCAGCAAAAATTTAATTCCCTTTAGCAGTGGAACTTTGTCTATAAACAGCTTAAATCAAGAAGAATATAT 
TTTTAAATCCTTAATCAAAACAAATAATCCACCAATACTAAAAATATTGTCAAAAAAGTTAATTCCAACCGTCTTG 
ACAAACATGACAAACCTCACAATATCAAGCCACATAAAGACCACAATAAAAGACCAAAATACGGTTGAAATAGAAT 
TTAATATTCAAAAATCTAGTGTTGAAAGCCTTATAGAAAAACTAGCTTCAAATATTCAAACCTAA 

t22.nt 

TGCGCAAGCCTGCCTTACACTCCTCCAAAACAAAATCTAAATTACTTAATGGAACTTTTACCTGGCGCAAATTTAT 
ACGCCCATGTAAATTTAATTAAAAACAGGTCTATTTATAACTCTTTAAGCCCTAAATATAAATCAGTTCTTGGGCT 
T ATAAGC AATT TAT AC TTTAGC T AT AAAAAAGAAAAT AACGATTTTGC TCTACTAATAATGGGTAATTTCC CAAAA 
GATATTTTCTGGGGAATTCATAAAAATAGAAATACAGAATCAATAGGCAATATATTTACAAATCCAAAATGGAAAC 
TTAAAAATTCAAATATATACATTATTCCAAACAAAGCTAGAACTAGCATTGCAATAACCCAAAAAGATATAACCGC 
AAAAGACAATAATATGCTAACAACAAAATATATTGGGGAAATAGAAAAAAATGAAATGTTTTTTTGGATTCAAGAT 
CCAACATTATTGCTCCCAAACCAAATAGTAAGCAGCAAAAATTTAATTCCCTTTAGCAGTGGAACTTTGTCTATAA 
ACAGCTTAAATCAAGAAGAATATATTTTTAAATCCTTAATCAAAACAAATAATCCACCAATACTAAAAATATTGTC 
AAAAAAGTTAATTCCAACCGTCTTGACAAACATGACAAACCTCACAATATCAAGCCACATAAAGACCACAATAAAA 
GACCAAAATACGGTTGAAATAGAATTTAATATTCAAAAATCTAGTGTTGAAAGCCTTATAGAAAAACTAGCTTCAA 
ATATTCAAACCTAA 

f32 .aa 

MNTKTLYLISLILLACNKNNKIPLIQKLDLPKSSILGFSNKMGIIIKDYAFLSKSTKKNSSLDYDYAILLRKDEW 
KIEKTLEKTERYGIEGNWILVNYKGTKRYIFSKDINIVNNLIIDHSK 

t32 .aa 

CNKNNK IPLIQKLDLPKSSILGF SNKMG III KDYAF LSKS TKKNS ELD YD YAI L LRKDEWKI EKT LEKTERYG I E 
GNWI L VNYKGTKR Y IFSKD INIVNNL 1 1 DHS K 

£3 2 .nt 

ATGAATACAAAAACATTATATTTAATATCCTTAATTCTTTTAGCTTGCAATAAAAATAACAAAATTCCTCTCATTC 
AAAAATTAGATTTGCCCAAAAGCAGCATTCTTGGCTTTAGCAATAAAATGGGCATAATAATAAAAGATTATGCTTT 
TCTTAGTAAAAGCACTAAGAAAAATAGCGAATTGGATTATGATTACGCAATTCTACTCAGAAAAGACGAAGTCGTA 
AAAATTGAAAAAACACTAGAAAAAACAGAGCGCTATGGAATTGAAGGAAATTGGATCCTAGTCAATTACAAGGGAA 
CTAAAAGATACATCTTTAGCAAAGACATCAATATAGTCAACAATTTAATAATTGATCATTCTAAATAG 

t3 2.nt 

TGCAATAAAAATAACAAAATTCCTCTCATTCAAAAATTAGATTTGCCCAAAAGCAGCATTCTTGGCTTTAGCAATA 
AAATGGGCATAATAATAAAAGATTATGCTTTTCTTAGTAAAAGCACTAAGAAAAATAGCGAATTGGATTATGATTA 
CGCAATTCTACTCAGAAAAGACGAAGTCGTAAAAATTGAAAAAACACTAGAAAAAACAGAGCGCTATGGAATTGAA 
GGAAATTGGATCCTAGTCAATTACAAGGGAACTAAAAGATACATCTTTAGCAAAGACATCAATATAGTCAACAATT 
TAATAATTGATCATTCTAAATAG 

fl86.aa 

MKKLIIIFTLFLSQACNLSTMHKIDTKEDMKILYSEIAELRKKLNLNHLEIDDTLEKVAKEYAIKLGENRTITHTL 
FGTTPMQRIHKYDQSFNLTREILASGIELNR^AMAWLNSPSHKEALINTDTDKIGGYRLXTTDNIDIFWLFGKRK 
YKN 



tl86 .aa 
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CNLSTMHKIDTKEDMKILYSEIAELRKKLNLNHLEIDDTLEKVAKEYAIKLGENRTITHTLFGTT 
PMQRIHKYDQSFNLTREILASGIELNRWTMAWLWSPSHKEALINTDTDKIGGYRLKTTDNIDIFWLFGKRKYKN 

f 186 .nt 

ATGAAAAAATTGATTATAATTTTTACACTGTTTTTATCTCAAGCATGCAATTTAAGTACAATGCATAAAATAGATA 
CAAAAGAAGATATGAAAATTCTATATTCAGAAATTGCTGAATTGAGAAAAAAATTAAATCTAAACCATCTAGAAAT 
AGATGATACCCTTGAAAAAGTTGCAAAAGAATATGCCATTAAACTGGGAGAAAATAGAACAATAACTCACACCCTT 
TTTGGCACAACCCCAATGCAAAGAATACATAAATACGATCAATCCTTTAATTTAACAAGAGAAATACTGGCATCAG 
GAATTGAACTTAACAGAGTAGTTAATGCATGGCTTAATAGTCCAAGCCACAAAGAAGCTCTTATTAATACAGATAC 
CGATAAAATAGGTGGCTATAGATTAAAAACGACTGACAATATAGATATATTTGTAGTTCTTTTTGGAAAAAGAAAA 
T AT AAGAATT G A 

tl86.nt 

TGCAATTTAAGTACAATGCATAAAATAGATACAAAAGAAGATATGAAAATTCTATATTCAGAAATTGCTGAATTGA 
GAAAAAAATT AAATCTAAACC ATC TAGAAAT AGATGAT ACC C TTGAAAAAGTTGC AAAAGAAT ATGC CATTAAACT 
GGGAGAAAATAGAACAATAACTCACACCCTTTTTGGCACAACCCCAATGCAAAGAATACATAAATACGATCAATCC 
TTTAATTTAACAAGAGAAATACTGGCATCAGGAATTGAACTTAACAGAGTAGTTAATGCATGGCTTAATAGTCCAA 
GCCACAAAGAAGCTCTTATTAATACAGATACCGATAAAATAGGTGGCTATAGATTAAAAACGACTGACAATATAGA 
TATATTTGTAGTTCTTTTTGGAAAAAGAAAATATAAGAATTGA 

f216.aa 

MIRVLLGSLAVSFLFSICMVFLNYDNLFSKKVFYFHSSKGFVANLRYLRDEQNLKDNLDLLVKDFLLGSNEGFSFG 
FLLSDSRFLYSFLKNGVYYWLSREFYDSFNNGDYNESNESFDVKVNLFAMSLIKTMRFNYPGKIKKIVILVEGCI 
LKEQS 

t216 .aa 

CMVFLNYDNLFSKKVFYFHSSKGFVANLRYLRDEQNLKDNLDLLVKDFLLGSNEGFSFGFLLSDSRFLYSFLKNGV 
YYVNLSREFYDSFNNGDYNESNESFDVKVNLFAMSLIKTMRFNYPGKIKKIVILVEGCILKEQS 

f216 .nt 

ATGATTAGGGTGCTTTTGGGGTCTTTGGCAGTAAGCTTTTTGTTTTCTATTTGTATGGTTTTTTTAAATTATGATA 
ATCTTTTTTCAAAAAAGGTTTTTTATTTTCATTCTAGCAAGGGATTTGTTGCTAATTTAAGATATTTAAGAGATGA 
ACAAAATTTGAAAGATAATTTAGATCTTTTAGTAAAAGATTTTCTTTTAGGAAGCAATGAAGGGTTTTCTTTTGGG 
TTTTTATTAAGTGATTCAAGATTTTTATATTCTTTTTTAAAGAATGGAGTTTATTATGTAAATCTTTCAAGAGAAT 
TTTATGATTCTTTTAATAATGGTGATTATAATGAATCTAATGAATCTTTTGATGTTAAGGTCAATCTTTTTGCTAT 
GTCTTTAATAAAAACAATGCGCTTTAACTATCCTGGTAAGATAAAAAAGATTGTTATTCTTGTTGAAGGGTGTATC 
TTAAAGGAGCAAAGTTGA 

t216.nt 

TGTATGGTTTTTTTAAATTATGATAATCTTTTTTCAAAAAAGGTTTTTTATTTTCATTCTAGCAAGGGATTTGTTG 
CTAATTTAAGATATTTAAGAGATGAACAAAATTTGAAAGATAATTTAGATCTTTTAGTAAAAGATTTTCTTTTAGG 
AAGCAATGAAGGGTTTTCTTTTGGGTTTTTATTAAGTGATTCAAGATTTTTATATTCTTTTTTAAAGAATGGAGTT 
TATTATGTAAATCTTTCAAGAGAATTTTATGATTCTTTTAATAATGGTGATTATAATGAATCTAATGAATCTTTTG 
ATGTTAAGGTCAATCTTTTTGCTATGTCTTTAATAAAAACAATGCGCTTTAACTATCCTGGTAAGATAAAAAAGAT 
TGTTATTCTTGTTGAAGGGTGTATCTTAAAGGAGCAAAGTTGA 

f328 .aa 



MAIKYAREmiPFLGICLGLQLAVIEFARjSTVCGILDADTEENLARDKPLKSPVIHLLPEQKGIKDKGATMRLGGYP 
VILKKNTIAFKLYGQDRIIERFRHRYEVNNDYIDLFAKNGLIVSGFSSDFKMAKjLIEIPENKFFVACQFHPELITR 
IENPAKLFLGLIKACI ■ 
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t328 .aa 

CLGLQLAVIEFARJWCGILDADTEENLARDKPLKSPVIHLLPEQKGIKDKGATMRLGGYPVILKKNTIAFKLYGQD 

RIIERFRHRYEVNNDYIDLFAKNGLIVSGFSSDFKMAKLIEIPENKFFVACQFHPELITRIENPAKLF 

LGLIKACI 



ATGGCTATTAAATATGCTCGTGAGAATAATATTCCCTTTCTTGGAATTTGTCTTGGTTTGCAGCTTGCTGTAATAG 
AATTTGCTCGTAATGTTTGTGGAATACTTGATGCTGATACGGAGGAAAATTTAGCAAGAGACAAGCCCTTAAAAAG 
TCCTGTTATCCATTTACTTCCTGAGCAAAAGGGAATTAAAGATAAGGGCGCTACAATGAGGCTTGGTGGATATCCT 
GTGATTCTTAAAAAGAATACAATAGCTTTTAAACTTTATGGCCAAGATCGGATAATTGAAAGATTTAGACATAGGT 
ATGAAGTCAATAATGATTATATAGATTTATTTGCAAAAAATGGGCTTATAGTATCTGGATTTTCAAGTGATTTTAA 
AATGGCAAAATTAATAGAAATTCCTGAAAATAAATTTTTCGTAGCTTGCCAGTTTCATCCAGAACTTATTACAAGA 
ATAGAAAATCCAGCCAAGCTTTTTCTAGGATTAATTAAAGCTTGTATTTGA 



TGTCTTGGTTTGCAGCTTGCT.GTAATAGAA'TTTGCTCGTAATGTTTGTGGAATACTTGATGCTGATACGGAGGAAA 
ATTTAGCAAGAGACAAGCCCTTAAAAAGTCCTGTTATCCATTTACTTCCTGAGCAAAAGGGAATTAAAGATAAGGG 
CGCTACAATGAGGCTTGGTGGATATCCTGTGATTCTTAAAAAGAATACAATAGCTTTTAAACTTTATGGCCAAGAT 
CGGATAATTGAAAGATTTAGACATAGGTATGAAGTCAATAATGATTATATAGATTTATTTGCAAAAAATGGGCTTA 
TAGTATCTGGATTTTCAAGTGATTTTAAAATGGCAAAATTAATAGAAATTCCTGAAAATAAATTTTTCGTAGCTTG 
CCAGTTTCATCCAGAACTTATTACAAGAATAGAAAATCCAGCCAAGCTTTTTCTAGGATTAATTAAAGCTTGTATT 
TGA 



MNKTKNRSLTYFIILSCISLFGAJSrNNTISYSSIEIPLEDLSEEFKSSGNKSDQINTSKHLNKNIVSYEDPKKGKDL 
KLPENIRDKKLPQKRMDENDLKSVIENYENKIKNIEKLLKTfasrQKTSENENKKIESIEKKAKKYEILTNKLKNEIV 
EIKKLLNKKIKPKEDENYEKINIENIEEETDDDFEDNYEYNDEIEXTNEDISnfPSNEGIINNLKENLNENEKYYAIN 
EKKIDELEDRINENENTI LDLQRELRNFKKKDNSDKNLE EIEENL S S IGRI INDLKRKI S ANEAINKENQKKI RTD 
KHKLKELEDKIKENEETILKLQKELNNFKKKEIYQKPLNEETFTPSITSKNDDLEENKKLKKEYLKPIEKKESRDL 
EENTKSTPKTTMIKTADFQIYPDIYLNNYKFKEKGDQFAFKKENTYYIEIDPTNNLNEALKNHEIISKYKFEKYFI 
NPILKNKEEFFRJNLIEVKNIHELGIKYKNLKPEFKQIKIIK 



CISLFGANNNTISYSSIEIPLEDLSEEFKSSGNKSDQINTSXHLNKNIVSYEDPKKGKDLKLPSNIRDKKLPQKRM 
DENDLKSVIENYENKIKNIEKLLKTKNQKTSENENKKIESIEKKAKKYEILTNKLKNEIVEIKKLLNKKIKPKEDE 
NYEKINIENIEEETDDDFEDNYEYNDEIEXTNEDNYPSNEGIINNLKENLNENEKYYAINEKKIDELSDRINENEN 
TILDLQRELRNFKKKDNSDKNLEEIEENLSSIGRI INDLKRKI SANEAINKENQKKIRTDKHKLKELEDKIKENEE 
TILKLQKELNNFKKKEIYQKPLNEETFTPSITSKNDDLEENKKLKKEYLKPIEKKESRDLEENTKSTPKTTMIKTA 
DFQIYPDIYLNNYKFKEKGDQFAFKKENTYYIEIDPTNNLNEALKNHEIISKYKFEKYFINPILKNKEEFFRNLIE 
VKNIHELGIMYKNLKPEFKQIKIIK 



ATGAATAAAACAAAAAATCGAAGCCTTACGTATTTTATAATACTTTCATGTATATCATTATTTGGGGCTAATAATA 
ATACAATAAGCTACTCTAGCATTGAAATTCCTCTAGAAGACTTAAGTGAAGAATO'TAAAAGTTCTGGGAATAAAAG 
CGATCAAATAAATACCTCAAAACATTTAAACAAAAACATAGTTTCTTATGAAGACCCAAAAAAGGGTAAAGATCTA 
AAA-TTGCCAGAAAATATAAGAGACAAAAAACTACCCCAAAAAAGAATGGACGAAAATGATCTAAAATCTGTAATTG 
AAAATTATGAAAATAAAATTAAAAACATAGAAAAGCTTTTAAAAACCAAAAATCAAAAAACATCGGAAAATGAAAA 
TAAAAAAATAGAATCAATCGAAAAAAAAGCAAAAAAATATGAAATTTTAACCAATAAATTAAAAAACGAAA'TAGTA 
GAAATAAAAAAGCTCCTTAACAAAAAAATCAAGCCTAAAGAAGATGAAAATTACGAAAAAATAAATATTGAAAACA 



f328 .nt 



t328 .nt 



f352 .aa 



t352 .aa 



f3 52 .nt 
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TTGAAGAAGAAACTGATGATGATTTTGAAGACAATTATGAATATAATGATGAAATTGAAGAACAAATGAGGACAAT 
TACCCTTCTAATGAAGGAATAATAAACAATCTAAAAGAAAATCTTAATGAAAACGAAAAATATTATGCTATTAATG 
AAAAAAAAATCGATGAACTTGAAGACAGAATCAACGAGAATGAAAACACTATTTTAGACTTGCAAAGAGAATTAAG 
GAATTTTAAAAAAAAAGATAACTCAGATAAAAACTTAGAAGAAATTGAGGAAAATTTATCTTCAATAGGAAGAATA 
ATTAATGATCTAAAAAGAAAAATCAGCGCAAATGAAGCAATAAACAAAGAAAATCAAAAAAAAATAAGAACTGATA 
AACACAAACTCAAAGAATTAGAAGATAAAATAAAGGAAAATGAAGAGACTATTTTAAAACTTCAAAAAGAATTAAA 
CAATTTTAAAAAAAAAGAAATTTATCAAAAACCCTTAAATGAAGAAACTTTCACTCCAAGCATTACAAGTAAAAAT 
GAC GAC TT AG AAGAAAAT AAG AAATT AAAAAAGGAATATT T AAAGC CC AT AG AAAAAAAAG AAAGC C G AGAT C TAG 
AAGAAAATACTAAAAGCACCCCAAAAACAACTATGATAAAAACAGCAGATTTTCAAATCTACCCTGACATATATCT 
TAATAATTATAAATTTAAAGAAAAGGGAGATCAATTTGCATTTAAAAAAGAAAACACATACTATATTGAAATAGAT 
CCCACTAACAATTTAAATGAGGCTTTAAAAAATCATGAAATAATCTCAAAATATAAATTTGAAAAATATTTCATTA 
ACCCTATTCTAAAAAATAAAGAAGAATTTTTTAGAAACTTAATAGAAGTCAAAAATATCCACGAACTAGGAATTAT 
GTATAAAAATCTAAAGCCTGAATTTAAGCAAATAAAAATAATTAAATAA 

t352 .nt 

TGTATATCATTATTTGGGGCTAATAATAATACAATAAGCTACTCTAGCATTGAAATTCCTCTAGAAGACTTAAGTG 
AAGAATTTAAAAGTTCTGGGAATAAAAGCGATCAAATAAATACCTCAAAJ^CATTTAAACAAAAACATAGTTTCTTA 
TGAAGACCCAAAAAAGGGTAAAGATCTAAAATTGCCAGAAAATATAAGAGACAAAAAACTACCCCAAAAAAGAATG 
GACGAAAATGATCTAAAATCTGTAATTGAAAATTATGAAJ^TAAAATTAAAAACATAGAAAAGCTTTTAAAAACCA 
AAAATCAAAAAACATCGGAAAATGAAAATAAAAAAATAGAATCAATCGAAAAAAAAGCAAAAAAATATGAAATTTT 
AACCAATAAATTAAAAAACGAAATAGTAGAAATAAAAAAGCTCCTTAACAAAAAAATCAAGCCTAAAGAAGATGAA 
AATTACGAAAAAATAAATATTGAAAACATTGAAGAAGAAACTGATGATGATTTTGAAGACAATTATGAATATAATG 
ATGAAATTGAAGAACAAATGAGGACAATTACCCTTCTAATGAAGGAATAATAAACAATCTAAAAGAAAATCTTAAT 
GAAAACGAAAAATATTATGCTATTAATGAAAAAAAAATCGATGAACTTGAAGACAGAATCAACGAGAATGAAAACA 
CTATTTTAGACTTGCAAAGAGAATTAAGGAATTTTAAAAAAAAAGATAACTCAGATAAAAACTTAGAAGAAATTGA 
GGAAAATTTATCTTCAATAGGAAGAATAATTAATGATCTAAAAAGAAAAATCAGCGCAAATGAAGCAATAAACAAA 
GAAAATCAAAAAAAAATAAGAACTGATAAACACAAACTCAAAGAATTAGAAGATAAAATAAAGGAAAATGAAGAGA 
CTATTTTAAAACTTCAAAAAGAATTAAACAATTTTAAAAAAAAAGAAATTTATCAAAAACCCTTAAATGAAGAAAC 
TTTCACTCCAAGCATTACAAGTAAAAATGACGACTTAGAAGAAAATAAGAAATTAAAAAAGGAATATTTAAAGCCC 
ATAGAAAAAAAAGAAAGCCGAGATCTAGAAGAAAATACTAAAAGCACCCCAAAAACAACTATGATAAAAACAGCAG 
ATTTTCAAATCTACCCTGACATATATCTTAATAATTATAAATTTAAAGAAAAGGGAGATCAATTTGCATTTAAAAA 
AGAAAACACATACTATATTGAAATAGATCCCACTAACAATTTAAATGAGGCTTTAAAAAATCATGAAATAATCTCA 
AAATATAAATTTGAAAAATATTTCATTAACCCTATTCTAAAAAATAAAGAAGAATTTTTTAGAAACTTAATAGAAG 
TCAAAAATATCCACGAACTAGGAATTATGTATAAAAATCTAAAGCCTGAATTTAAGCAAATAAAAATAATTAAATA 
A 

f867 .aa 

MNTKGKWGWGNLVTIEVEGSVSMNEVLFVKTAGRNLKASVIRIRGNEVDAQVF 

ELGPGLLTQWDGLQNPLPELAIQCGFFLERGVYLRPLNKDKKWNFKKTSKVGDIVIAGDFLG?VIEGTVHHQIMI 
PFYKRDSYKIVEIVSDGDYSIDSQIAVIEDDSGMRHNITMSFHWPVKVPITNYKERLIPSEPMLTQTRIIDTFFPV 
AKGGTFCIPGPFGAGKTVLQQVTSRNADVDWIIAACGERAGEWETLKEFPELMDPKTGKSLMDRTCIICNTSSM 
PVAAREASVYTAITIGEYYRQMGLDILLLADSTSRWAQAMREMSGRLEEIPGEEAFPAYLESVIASFYERAGIWL 
NNGDIGSVTVGGSVSPAGGNFEEPVTQATLKWGAFHGLTRERSDARKFPAISPLESWSKYKGVIDQKKTEYARSF 
LVKGNEINQMMKWGEEGISNDDFLIYLKSELLDSCYLQQNSFDSIDAAVSSERQNYMFDIVYNILKTNFEFSDKL 
QARDFINELRQNLLDMNLSSFKDHKFNKLEHALGELINFKKVI 

t867.aa 

GRNLKAEVIRIRGNEVDAQVFELTKGISVGDLVEFTDKLLTVELGPGLLTQVYDGLQNPLPELAIQCGFFLERGVY 
LRPLNKDKKWNFKKTSKVGDIVIAGDFLGFVIEGTVHHQIMIPFYKRDSYKIVEIVSDGDYSIDEQIAVIEDDSGM 
RHNITMSFHWPVKVPITNYKERLIPSEPMLTQTRIIDTFFPVAKGGTFCIPGPFGAGKTVLQQVTSRNADVDWII 
AACGEPvAGEWETLKEFPELMDPKTGKSLMDRTCIICNTSSMPVAAREASVYTAITIGEYYRQMGLDILLLADSTS 
RWAQAMREMSGRLEEIPGEEAFPAYLESVIASFYERAGIWLNWGDIGSVTVGGSVSPAGGNFEEPVTQATLKWG 
AFHGLTRERSDARKFPAISPLESWSKYKGVIDQKKTEYARSFLVKGNEINQMMKWGEEGISNDDFLIYLKSELLD 
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SCYLQQNSFDSIDAAVSSERQNYMFDIVYNILKTNFEFSDKLQARDFINELRQNLLDMNLSSFKDHKFNKLEHALG 
ELINFKKVI 

f 867 .nt 



ATGAATACAAAAGGAAAAGTCGTTGGAGTTAATGGAAACTTAGTTACTATTGAGGTAGAAGGTTCAGTTTCTATGA 
ATGAAGTTTTATTTGTAAAGACTGCTGGTAGGAATTTAAAAGCAGAAGTAATTCGTATTAGGGGCAATGAAGTTGA 
TGCACAGGTTTTTGAATTGACAAAAGGGATATCTGTTGGAGACCTAGTTGAATTTACAGACAAACTTTTAACAGTT 
GAACTCGGACCAGGGCTTTTAACTCAAGTATATGATGGGCTTCAAAATCCTTTGCCTGAATTGGCTATTCAATGTG 
GATTTTTTTTAGAAAGGGGAGTATATTTAAGGCCCTTGAATAAAGATAAAAAGTGGAATTTTAAAAAAACCTCCAA 
AGTTGGAGATATCGTTATTGCAGGAGATTTTTTAGGTTTTGTAATTGAGGGAACTGTTCACCATCAAATAATGATT 
•CCATTTTATAAAAGGGATTCTTATAAAATTGTGGAGATTGTAAGTGATGGCGACTATTCGATTGATGAGCAAATTG 
CTGTAATTGAAGATGATTCTGGTATGAGGCATAATATTACAATGTCTTTTCATTGGCCTGTTAAAGTTCCTATTAC 
TAATTATAAGGAACGCCTTATTCCTAGTGAACCTATGTTGACTCAAACTAGAATTATAGATACATTTTTCCCAGTT 
GCCAAAGGTGGAACTTTTTGCATTCCGGGTCCTTTTGGAGCAGGAAAAACGGTTCTTCAGCAGGTTACAAGTCGAA 
ATGCTGATGTTGATGTAGTGATTATTGCAGCTTGTGGTGAGCGAGCAGGAGAAGTGGTAGAAACTCTTAAAGAATT 
TCCCGAATTAATGGATCCAAAAACCGGCAAATCTTTAATGGACAGGACTTGTATTATTTGTAATACATCTTCAATG 
CCAGTTGCAGCTAGAGAAGCTTCTGTTTATACTGCTATTACTATTGGTGAGTATTACAGGCAAATGGGCCTTGATA 
'TTCTTCTTTTGGCAGATTCAACTTCAAGATGGGCTCAAGCAATGAGAGAAATGTCTGGACGCCTTGAGGAAATTCC 
TGGCGAGGAGGCTTTTCCGGCATATCTTGAGTCTGTTATTGCTTCCTTTTATGAAAGGGCAGGTATTGTAGTTCTT 
AATAATGGGGATATTGGATCTGTAACAGTTGGTGGCTCTGTAAGTCCTGCTGGTGGTAATTTTGAAGAGCCAGTTA 
CTCAAGCAACTTTAAAAGTTGTAGGAGCATTTCACGGGCTTACAAGAGAAAGGTCTGATGCTAGGAAATTTCCAGC 
TATTAGTCCTCTTGAATCTTGGAGTAAATATAAAGGCGTTATTGATCAAAAAAAGACTGAATATGCAAGATCTTTT 
TTGGTGAAAGGTAATGAAATTAATCAAATGATGAAAGTTGTTGGAGAAGAAGGCATAAGTAACGATGATTTTTTAA 
TTTATTTAAAATCCGAGCTACTTGATTCGTGCTATTTGCAGCAAAATTCATTTGATTCTATTGATGCTGCTGTTAG 
TTCAGAGCGTCAAAATTATATGTTTGATATAGTTTATAACATTCTTAAAACTAACTTTGAGTTTTCTGATAAACTT 
CAAGCAAGAGATTTTATAAATGAGTTAAGGCAAAATCTTTTAGACATGAATCTTTCTTCTTTT.AAGGATCATAAGT 
TTAATAAATTGGAGCATGCTTTGGGTGAATTGATAAATTTTAAAAAGGTAATTTAG 

t867 .nt 

GGTAGGAATTTAAAAGCAGAAGTAATTCGTATTAGGGGCAATGAAGTTGATGCACAGGTTTTTGAATTGACAAAAG 
GGAT 

ATCTGTTGGAGACCTAGTTGAATTTACAGACAAACTTTTAACAGTTGAACTCGGACCAGGGCTTTTAACTCAAGTA 
TATGATGGGCTTCAAAATCCTTTGCCTGAATTGGCTATTCAATGTGGATTTTTTTTAGAAAGGGGAGTATATTTAA 
GGCCCTTGAATAAAGATAAAAAGTGGAATTTTAAAAAAACCTCCAAAGTTGGAGATATCGTTATTGCAGGAGATTT 
TTTAGGTTTTGTAATTGAGGGAACTGTTCACCATCAAATAATGATTCCATTTTATAAAAGGGATTCTTATAAAATT 
GTGGAGATTGTAAGTGATGGCGACTATTCGATTGATGAGCAAATTGCTGTAATTGAAGATGATTCTGGTATGAGGC 
ATAATATTACAATGTCTTTTCATTGGCCTGTTAAAGTTCCTATTACTAATTATAAGGAACGCCITATTCCTAGTGA 
ACCTATGTTGACTCAAACTAGAATTATAGATACATTTTTCCCAGTTGCCAAAGGTGGAACTTTTTGCATTCCGGGT 
CCTTTTGGAGCAGGAAAAACGGTTCTTCAGCAGGTTACAAGTCGAAATGCTGATGTTGATGTAGTGATTATTGCAG 
CTTGTGGTGAGCGAGCAGGAGAAGTGGTAGAAACTCTTAAAGAATTTCCCGAATTAATGGATCCAAAAACCGGCAA 
ATCTTTAATGGACAGGACTTGTATTATTTGTAATACATCTTCAATGCCAGTTGCAGCTAGAGAAGCTTCTGTTTAT 
ACTGCTATTACTATTGGTGAGTATTACAGGCAAATGGGCCTTGATATTCTTCTTTTGGCAGATTCAACTTCAAGAT 
" GGGCTCAAGCAATGAGAGAAATGTCTGGACGCCTTGAGGAAATTCCTGGCGAGGA.GGCTTTTCCGGCATATCTTGA 
GTCTGTTATTGCTTCCTTTTATGAAAGGGCAGGTATTGTAGTTCTTAATAATGGGGATATTGGATCTGTAACAGTT 
GGTGGCTCTGTAAGTCCTGCTGGTGGTAATTTTGAAGAGCCAGTTACTCAAGCAACTTTAAAAGTTGTAGGAGCAT 
TTCACGGGCTTACAAGAGAAAGGTCTGATGCTAGGAAATTTCCAGCTATTAGTCCTCTTGAATCTTGGAGTAAATA 
TAAAGGCGTTATTGATCAAAAAAAGACTGAATATGCAAGATCTTTTTTGGTGAAAGGTAATGAAATTAATCAAATG 
ATGAAAGTTGTTGGAGAAGAAGGCATAAGTAACGATGATTTTTTAATTTATTTAAAATCCGAGCTACTTGATTCGT 
GCTATTTGCAGCAAAATTCATTTGATTCTATTGATGCTGCTGTTAGTTCAGAGCGTCAAAATTATATGTTTGATAT 
AGTTTATAACATTCTTAAAACTAACTTTGAGTTTTCTGATAAACTTCAAGCAAGAGATTTTATAAATGAGTTAAGG 
CAAAATCTTTTAGACATGAATCTTTCTTCTTTTAAGGATCATAAGTTTAATAAATTGGAGCATGCTTTGGGTGAAT 
TGATAAATTTTAAAAAGGTAATTTAG 
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f 8 68 . aa 

MKRVYSKIESIAGNVITVTAQGIKYGELAIVKAKDTSSLAEVIKLDRSKVSLQVYGGTRGVSTSDEIKFLGHSMQV 
SFSDNLLGRIFDGSGNPRD3GPSLDDNLIEIGGPSANPTKRIVPRNMIRTGLPMIDVFNTLVESQKLPIFSVSGE? 
YNELLIRIALQAEVDLIILGGMGLKHDDYLTFKDSLEKGGALSRAIFFVHTANDSWESLTVPDISLSVAEKFALK 
GKKVLVLLTDMTNFADAMKEISITMEQVPSNRGYPGDLYSQLAYRYEKAIDFEGAGSITILAVTTMPGDDVTHPVP 
DNTGYITEGQYYLKGGRIEPFGSLSRLKQMVNSRTRDDHRTIMDSMIKLYASSKESVEKKAMGFNMTKWDEKLLKY 
SNMF ESKMMDLS VTSf I PLEEALDLGWS I LASCF S PKETG IKTDL I EKYWPKKET Y 

t868.aa 

QGIKYGELAIVKAKDTSSLAEVIKLDREKVSLQVYGGTRGVSTSDEIKFLGHSMQVSFSDNLLGRIFDGSGNPRDG 
GPSLDDNLIEIGGPSANPTKRIVPRNMIRTGLPMIDVFNTLVESQKLPIFSVSGEPYNELLIRIALQAEVDLIILG 
GMGLKHDDYLTFKDSLEKGGALSRAIFFVHTANDSWESLTVPDISLSVAEKFALKGKKVLVLLTDMTNFADAMKE 
ISITMEQVPSNRGYPGDLYSQLAYRYEKAIDFEGAGSITILAVTTMPGDDVTKPVPDNTGYITEGQYYLKGGRIEP 
FGSLSRLKQMWSRTRDDHRTIMDSMIKLYASSKESVEKKAMGFNMTKWDEKLLKYSNMFESKMMDLSVNIPLEEA 
LDLGWSILASCFSPKETGIKTDLIEKYWPKKETY 



ATGAAAAGAGTCTATAGTAAAATAGAGTCTATAGCAGGCAATGTAATAACTGTTACAGCTCAAGGTATTAAGTATG 
GTGAGCTTGC T ATTGTAAAAGC AAAAGAT AC AAGTT CTC TAGC CGAAGTAATTAAACTTGAT CGAGAAAAAGTTTC 
TCTTCAGGTTTATGGTGGTACAAGAGGTGTTTCCACGTCAGACGAGATAAAGTTTTTAGGGCATTCAATGCAGGTT 
TCATTTTCTGACAATTTGTTGGGCAGAATTTTTGATGGTTCTGGGAATCCTAGAGATGGGGGCCCTTCTCTTGATG 
ATAATTTGATTGAAATTGGTC-GGCCTTCTC-CAAATCCTACAAAACGCATTGTTCCTAGAAATATGATAAGGACAGG 
GCTTCCAATGATAGATGTTTTTAATACTCTTGTTGAATCTCAAAAATTGCCAATTTTTTCTGTTTCTGGTGAGCCT 
TATAATGAGCTTCTTATAAGAATTGCACTTCAAGCAGAAGTTGATTTAATAATTCTTGGCGGAATGGGACTTAAGC 
ATGATGATTATTTAACTTTTAAAGATTCTTTAGAAAAGGGAGGTGCTTTAAGTAGAGCAATTTTTTTTGTTCATAC 
TGCTAATGATTCTGTTGTTGAATCTTTAACTGTTCCTGATATTTCACTTTCTGTTGCTGAAAAGTTTGCTCTAAAG 
GGCAAAAAAGTTTTGGTGCTTCTCACAGACATGACAAATTTTGCTGATGCAATGAAAGAAATATCTATTACAATGG 
AACAAGTGCCTTCTAATAGAGGTTATCCCGGGGATTTGTATTCTCAGCTTGCATATCGTTATGAGAAGGCTATTGA 
CTTTGAAGGCGCAGGATCAATTACAATACTTGCAGTTACAACAATGCCGGGTGACGATGTTACTCATCCTGTTCCT 
GACAATACTGGATACATTACAGAAGGTCAATACTATTTAAAAGGTGGCAGAATAGAGCCTTTTGGGTCTCTTTCAA 
GACTTAAGCAAATGGTAAATAGTAGAACTAGAGACGATCACAGGACTATAATGGATTCAATGATCAAGCTTTATGC 
ATCTTCAAAAGAGTCTGTAGAAAAAAAGGCTATGGGATTTAATATGACTAAGTGGGATGAAAAATTGCTCAAGTAT 
AGCAATATGTTTGAAAGTAAGATGATGGATTTGTCTGTTAATATTCCTTTAGAAGAGGCTTTAGATTTAGGTTGGA 
GCATTCTTGCTAGTTGTTTTAGCCCAAAAGAAACGGGAATAAAAACAGATCTTATTGAAAAATATTGGCCTAAAAA 
AGAGAC TTATTGA _ . 

t868 .nt 

CAAGGTATTAAGTATGGTGAGCTTGCTATTGTAAAAGCAAAAGATACAAGTTCTCTAGCCGAAGTAATTAAACTTG 
ATCGAGAAAAAGTTTCTCTTCAGGTTTATGGTGGTACAAGAGGTGTTTCCACGTCAGACGAGATAAAGTTTTTAGG 
GCATTCAATGCAGGTTTCATTTTCTGACAATTTGTTGGGCAGAATTTTTGATGGTTCTGGGAATCCTAGAGATGGG 
GGCCCTTCTCTTGATGATAATTTGATTGAAATTGGTGGGCCTTCTGCAAATCCTACAAAACGCATTGTTCCTAGAA 
ATATGATAAGGACAGGGCTTCCAATGATAGATGTTTTTAATACTCTTGTTGAATCTCAAAAATTGCCAATTTTTTC 
TGTTTCTGGTGAGCCTTATAATGAGCTTCTTATAAGAATTGCACTTCAAGCAGAAGTTGATTTAATAATTCTTGGC 
GGAATGGGACTTAAGCATGATGATTATTTAACTTTTAAAGATTCTTTAGAAAAGGGAGGTGCTTTAAGTAGAGCAA 
TTTTTTTTGTTCATACTGCTAATGATTCTGTTGTTGAATCTTTAACTGTTCCTGATATTTCACTTTCTGTTGCTGA 
AAAGTTTGCTCTAAAGGGCAAAAAAGTTTTGGTGCTTCTCACAGACATGACAAATTTTGCTGATGCAATGAAAGAA 
ATATCTATTACAATGGAACAAGTGCCTTCTAATAGAGGTTATCCCGGGGATTTGTATTCTCAGCTTGCATATCGTT 
ATGAGAAGGCTATTGACTTTGAAGGCGCAGGATCAATTACAATACTTGCAGTTACAACAATGCCGGGTGACGATGT 
TACTCATCCTGTTCCTGACAATACTGGATACATTACAGAAGGTCAATACTATTTAAAAGGTGGGAGAATAGAGCCT 
TTTGGGTCTCTTTCAAGACTTAAGCAAATGGTAAATAGTAGAACTAGAGACGATCACAGGACTATAATGGATTCAA 
TGATCAAGCTTTATGCATCTTCAAAAGAGTCTGTAGAAAAAAA.GGCTATGGGATTTAATATGACTAAGTGGGATGA 
AAAATTGCTCAAGTATAGCAATATGTTTGAAAGTAAGATGATGGATTTGTCTGTTAATATTCCTTTAGAAGAGGCT 
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TTAGATTTAGGTTGGAGCATTCTTGCTAGTTGTTTTAGCCCAAAAGAAACGGGAATAAAAACAGATCTTATTGAAA. 
AATATTGGCCTAAAAAAGAGACTTATTGA 

f872 f aa 

MRSAVLFFFALPFSISLYSSSHKNFPYWILLEKGRQFLYSKSEFSKSNLTHAINYLQEALLRKGVYPSASYYLSVA 

YGMSGNAILEKLNLYKSFEDRYYLLDESFEKKILFSLAKMAELENNYVDTIDYLNDILNKFSTKKDYYSYHDYSQG 

ENSMSNNELNASFYLTSYLKQVRGAFGIDFTFNLYRFKNYNVIDTHQLLSKVYLHLKAYELSITHGLIAAVGILTR 

MYDYVCYYEPVYQFKNLRSFVQKINKYKAIKNAFESTDFW 

SPYIAKSRSQIKNSVYLKKN 

t872 . aa 

SNKNFPYWILLEKGRQFLYSKSEFSKSNLTHAINYLQEALLRKGVYPEASYYLSVAYGMSGNAILEKLNLYKSFED 
RYYLLDESFEKKILFSLAKMAELENNYVDTIDYLNDILNKFSTKKDYYSYHDYSQGENSMSNNELNASFYLTSYLK 
QVRGAFGIDFTFNLYRFKNYNVIDTHQLLSKVYLHLKAYELSITHGLIAAVGILTRMYDYVCYYEPVYQFKNLRSF 
VQKINKYKAIKNAFESTDFWEIVYWAAATYAYSNGNYKFRAIDTWKLVVDLAPRFSPYIAKSRSQIKNSVYLKKN 

f 872 .nt 

ATGAGAAGTGCGGTTTTATTTTTTTTTGCTTTGCCTTTTTCTATTTCTTTGTATTCTTCAAGTAATAAAAATTTTC 
CGTATTGGATTTTACTTGAAAAAGGCAGGCAATTTCTTTATTCTAAATCTGAATTTAGTAAGTCTAATCTTACACA 
TGCTATTAATTATTTGCAGGAAGCTTTGCTTAGAAAAGGCGTTTATCCTGAGGCTAGTTATTATTTGTCAGTAGCT 
TATGGTATGTCTGGCAATGCTATTCTTGAAAAATTAAACCTTTATAAGTCTTTTGAAGACAGATATTATTTGCTAG 
ATGAATCTTTTGAAAAAAAAATACTTTTTTCTTTAGCTAAAATGGCTGAACTTGAGAATAATTATGTTGATACTAT 
TGATTATTTGAATGACATATTAAATAAGTTTTCAACTAAAAAAGATTATTATAGTTATCATGATTATTCTCAAGGC 
GAAAACAGTATGTCAAATAATGAACTTAATGCTTCATTTTATTTAACTTCTTATTTAAAACAAGTAAGAGGAGCTT 
TTGGTATTGATTTTACTTTTAATCTTTACAGATTTAAAAACTACAATGTTATTGATACTCATCAATTATTGTCAAA 
AGTTTATTTGCACTTAAAAGCTTATGAGCTTTCAATTACTCATGGACTTATAGCTGCAGTAGGAATTTTAACAAGA 
ATGTATGATTATGTTTGTTATTATGAACCTGTGTATCAGTTTAAAAATTTAAGGTCTTTTGTTCAAAAAATTAATA 
AGTATAAGGCAATAAAAAATGCTTTTGAATCTACAGATTTTTGGGAAATAGTTTATAATGTTGCTGCTGCTACTTA 
TGCATATTCTAATGGCAATTATAAATTTAGAGCAATAGATACTTGGAAATTAGTAGTAGATCTTGCGCCAAGGTTT 
TCTCCTTATATTGCTAAATCAAGAAGTCAAATTAAAAATTCTGTATATTTAAAAAAAAATTAA 

t872.nt 

AGTAATAAAAATTTTCCGTATTGGATTTTACTTGAAAAAGGCAGGCAATTTCTTTATTCTAAATCTGAATTTAGTA 
AGTCTAATCTTACACATGCTATTAATTATTTGCAGGAAGCTTTGCTTAGAAAAGGCGTTTATCCTGAGGCTAGTTA 
TTATTTGTCAGTAGCTTATGGTATGTCTGGCAATGCTATTCTTGAAAAATTAAACCTTTATAAGTCTTTTGAAGAC 
AGATATTATTTGCTAGATGAATCTTTTGAAAAAAAAATACTTTTTTCTTTAGCTAAAATGGCTGAACTTGAGAATA 
ATTATGTTGATACTATTGATTATTTGAATGACATATTAAATAAGTTTTCAACTAAAAAAGATTATTATAGTTATCA 
TGATTATTCTCAAGGCGAAAACAGTATGTCAAATAATGAACTTAATGCTTCATTTTATTTAACTTCTTATTTAAAA 
CAAGTAAGAGGAGCTTTTGGTATTGATTTTACTTTTAATCTTTACAGATTTAAAAACTACAATGTTATTGATACTC 
ATCAATTATTGTCAAAAGTTTATTTGCACTTAAAAGCTTATGAGCTTTCAATTACTCATGGACTTATAGCTGCAGT 
AGGAATTTTAACAAGAATGTATGATTATGTTTGTTATTATGAACCTGTGTATCAGTTTAAAAATTTAAGGTCTTTT 
'GTTCAAAAAATTAATAAGTATAAGGCAATAAAAAATGCTTTTGAATCTACAGATTTTTGGGAAATAGTTTATAATG 
TTGCTGCTGCTACTTATGCATATTCTAATGGCAATTATAAATTTAGAGCAATAGATACTTGGAAATTAGTAGTAGA 
TCTTGCGCCAAGGTTTTCTCCTTATATTGCTAAATCAAGAAGTCAAATTAAAAATTCTGTATATTTAAAAAAAAAT 
TAA 

f874.aa 

MLKSNKWLIGAGGVGSSFAYALTIDNSLVHELVIIDWENKAKGEWDLNHGQMFLXKNIWLFGTYKDCANADI 
WITAGLNQKPGETRLDLVDKNSKIFKDIITlW^vSSGFDGIFWASNPVDIMTYVTMKYSKFPIHKVIGTGTILDT 
SRLRYFLSDHFJWNTQNIHSYIMGEHXDSSFATWDETKIA^PLSEYLAEGKITELELDEIHKKVVNAAYEVIKLK 
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GATYYAIGLGIKNIVNAIIGDQNVILPISSYINGQYGGLIKDIYIGAPAIVCKEGVKEVLNFKISPKELDKFNSSA 
WQLKSYIDKMEF 

t874.aa 

ALTIDNSLVHELVIIDWENKAKGEVMDLNHGQMFLKKNINVLFGTYKDCANADIWITAGLNQKPGETRLDLVDK 
NSKIFKDIITNWSSGFDGIFWASNPVDIMTYVTMKYSKFPIHKVIGTGTILDTSRLRYFLSDHFNVNTQNIHSY 
IMGEHXDSSFATWDETKIAMKPLSEYLAEGKITELELDEIHKKWNAAYEVIKLKGATYYAIGLGIKNIVNAIIGD 
QNVILPISSYINGQYGGLIKDIYIGAPAIVCKEGVKEVLNFKISPKELDKFNSSANQLKSYIDKMEF 

f 874 ..nt 

ATGCTTAAGTCTAATAAAGTTGTTCTTATTGGAGCTGGTGGGGTTGGTTCAAGCTTTGCGTATGCTTTAACAATAG 
ACAATTCACTTGTACATGAACTTGTAATTATTGATGTTAATGAAAATAAAGCAAAAGGGGAGGTCATGGACCTTAA 
TCATGGCCAAATGTTTTTAAAGAAGAATATTAATGTATTGTTTGGGACTTACAAAGATTGTGCTAATGCAGATATT 
GTTGTAATTACAGCAGGACTTAATCAAAAGCCTGGTGAGACAAGACTTGATTTGGTTGATAAAAATTCTAAAATTT 
TTAAAGATATTATAACTAATGTTGTATCTAGCGGTTTTGATGGTATTTTTGTTGTTGCAAGCAATCCTGTAGACAT 
TATGACTTATGTTACAATGAAATATTCCAAATTTCCTATTCATAAGGTTATTGGTACTGGGACTATTCTTGATACT 
TCAAGACTTAGATATTTTTTAAGTGATCATTTTAATGTGAACACTCAAAATATACATTCATATATTATGGGTGAGC 
ACGTGACAGTTCTTTTGCTACGTGGGATGAAACAAAAATAGCAATGAAGCCTTTGTCAGAATATCTTGCTGAAGGC 
AAAATAACTGAGTTGGAGCTTGATGAAATTCATAAAAAGGTTGTGAATGCTGCTTATGAAGTTATTAAGTTAAAGG 
GGGCAACCTATTATGCTATTGGACTTGGTATTAAGAATATTGTAAATGCAATAATTGGAGATCAGAATGTTATTCT 
GCCAATATCTTCTTATATTAATGGCCAGTATGGGGGATTGATTAAAGATATTTATATTGGAGCGCCTGCTATAGTT 
TGTAAGGAAGGAGTCAAAGAAGTTTTAAACTTTAAGATAAGCCCTAAAGAGCTTGATAAGTTTAATAGTTCTGCTA 
ATCAGCTTAAAAGCTATATTGATAAAATGGAATTTTAG 

t874.nt 

GCTTTAACAATAGACAATTCACTTGTACATGAACTTGTAATTATTGATGTTAATGAAAATAAAGCAAAAGGGGAGG 
TCATGGACCTTAATCATGGCCAAATGTTTTTAAAGAAGAATATTAATGTATTGTTTGGGACTTACAAAGATTGTGC 
TAATGCAGATATTGTTGTAATTACAGCAGGACTTAATCAAAAGCCTGGTGAGACAAGACTTGATTTGGTTGATAAA 
AATTCTAAAATTTTTAAAGATATTATAACTAATGTTGTATCTAGCGGTTTTGATGGTATTTTTGTTGTTGCAAGCA 
ATCCTGTAGACATTATGACTTATGTTACAATGAAATATTCCAAATTTCCTATTCATAAGGTTATTGGTACTGGGAC 
TATTCTTGATACTTCAAGACTTAGATATTTTTTAAGTGATCATTTTAATGTGAACACTCAAAATATACATTCA'TAT 
ATTATGGGTGAGCACGTGACAGTTCTTTTGCTACGTGGGATGAAACAAAAATAGCAATGAAGCCTTTGTCAGAATA 
TCTTGCTGAAGGCAAAATAACTGAGTTGGAGCTTGATGAAATTCATAAAAAGGTTGTGAATGCTGCTTATGAAGTT 
ATTAAGTTAAAGGGGGCAACCTATTATGCTATTGGACTTGGTATTAAGAATATTGTAAATGCAATAATTGGAGATC 
AGAATGTTATTCTGCCAATATCTTCTTATATTAATGGCCAGTATGGGGGATTGATTAAAGATATTTATATTGGAGC 
GCCTGCTATAGTTTGTAAGGAAGGAGTCAAAGAAGTTTTAAACTTTAAGATAAGCCCTAAAGAGCTTGATAAGTTT 
AATAGTTCTGCTAATCAGCTTAAAAGCTATATTGATAAAATGGAATTTTAG 

f 886 .aa 

MKKKQLILLLFMPQIIYAKSYFASDVFFNKYQKLNEKPKTGFYIEYYSVDDTEKLYLYKENNLIKYKTIQIIENTK 
KITCYDTKDTKRKEEIYDNLNNKIQEIEYDSKGKTLETAHYVYENENLISKNLKTINQKPKLIYYSKDDNGKLLKI 
TGSNFQIMNYGINGDIKSTYFDIKKATTKVIKYDDKKRNSNSTIIWNKIKSKEKNQYLDEEKIWTFEEENTKII 
STYKANNLIKEETYKNNELIKVOTDFQYNESDMIIFQNTKEKDKDQYTNTKIEYEYNKDNQLKSKKIYENDIIYLKT 
EYHNDNEYEEEIYYNKKPALRVKHKNGKVTEEKPIGTN 

t885.aa 

SYFASDVFFNKYQKLNEKPKTGFYIEYYSVDDTEKLYLYKENNLIKYKTIQIIENTKKITCYDTKDTKRKEEIYDN 
LNNKIQEIEYDSKGKTLETANYVYENENLISKNLKTINQKPKLIYYSKDDNGKLLKITGSNFQIWNYGINGDIKST 
YFDIKKATTKVIKYDDKKRNSNSTIIVINM^ 

IKVNDFQYNESDMIIFQNTXEKDKDQYTNTKIEYEYNKDNQLKSKKIYENDIIYLKTEYHNDNEYEEEIYYNKKPA 
LRVKHKNGKVTEEKPIGTN 
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f886.nt 

ATGAAAAAAAAACAATTAATACTTCTTCTATTTATGCCACAAATTATTTATGCAAAAAGCTATTTTGCATCTGATG 
TATTTTTCAATAAATACCAAAAATTAAATGAAAAACCAAAAACGGGGTTTTATATTGAGTATTATTCTGTTGATGA 
TACTGAAAAACTCTACCTATACAAAGAAAATAACTTAATAAAATACAAAACAATTCAAATCATAGAAAACACAAAA 
AAAATTACATGTTATGATACAAAAGATACAAAAAGAAAAGAAGAGATTTACGATAATTTAAATAACAAAATACAAG 
AAATTGAATATGATAGCAAAGGAAAAACTCTTGAAACAGCAAATTACGTTTATGAAAACGAAAACTTAATATCTAA 
AJ^ATTTAAAAACAATAAACCAAAAACCAAAATTAATATATTATTCTAAAGACGACAATGGTAAATTACTAAAAATA 
ACAGGATCAAATTTCCAAATTTGGAACTATGGAATTAATGGCGACATAAAATCTACATATTTTGACATCAAAAAAG 
CAACAACAAAAGTTATAAAATATGATGATAAAAAAAGAAATTCAAACAGTACAATAATTGTTAATAATAAAATAAA 
ATC C AAAG AAAAAAAC C AAT ATT TAGATGAAG AAAAAAT AGT AAAT AC C TTT G AAGAAG AGAAT AC AAAAAT CAT A 
TCTACCTACAAGGCAAACAACCTAATTAAAGAAGAAACATATAAAAATAATGAACTTATAAAAGTAAATGATTTTC 
AATACAACGAATCTGATATGATAATTTTTCAAAACACTAAAGAAAAGGATAAAGACCAATACACCAATACTAAAAT 
TGAATACGAATATAACAAAGACAATCAATTAAAAAGCAAAAAAATTTATGAGAACGATATAATTTATCTAAAAACT 
GAATACCACAATGACAATGAATATGAAGAAGAAATATACTACAATAAAAAACCTGCTCTTAGGGTAAAACACAAGA 
ACGGAAAAGTCACCGAAGAAAAACCAATAGGAACAAATTAA 

t886 .rit 

AGCTATTTTGCATCTGATGTATTTTTCAATAAATACCAAAAATTAAATGAAAAACCAAAAACGGGGTTTTATATTG 
AGTATTATTCTGTTGATGATACTGAAAAACTCTACCTATACAAAGAAAATAACTTAATAAAATACAAAACAATTCA 
AATCATAGAAAACACAAAAAAAATTACATGTTATGATACAAAAGATACAAAAAGAAAAGAAGAGATTTACGATAAT 
TTAAATAACAAAATACAAGAAATTGAATATGATAGCAAAGGAAAAACTCTTGAAACAGCAAATTACGTTTATGAAA 
ACGAAAACTTAATATCTAAAAATTTAAAAACAATAAACCAAAAACCAAAATTAATATATTATTCTAAAGACGACAA 
TGGTAAATTACTAAAAATAACAGGATCAAATTTCCAAATTTGGAACTATGGAATTAATGGCGACATAAAATCTACA 
TATTTTGACATCAAAAAAGCAACAACAAAAGTTATAAAATATGATGATAAAAAAAGAAATTCAAACAGTACAATAA 
TTGTTAATAATAAAATAAAATCCAAAGAAAAAAACCAATATTTAGATGAAGAAAAAATAGTAAATACCTTTGAAGA 
-AGAGAATACAAAAATCATATCTACCTACAAGGCAAACAACCTAATTAAAGAAGAAACATATAAAAATAATGAACTT 
ATAAAAGTAAATGATTTTCAATACAACGAATCTGATATGATAATTTTTCAAAACACTAAAGAAAAGGATAAAGACC 
AATACACCAATACTAAAATTGAATACGAATATAACAAAGACAATCAATTAAAAAGCAAAAAAATTTATGAGAACGA 
TATAATTTATCTAAAAACTGAATACCACAATGACAATGAATATGAAGAAGAAATATACTACAATAAAAAACCTGCT 
CTTAGGGTAAAACACAAGAACGGAAAAGTCACCGAAGAAAAACCAATAGGAACAAATTAA 

f888.aa 

MEKLKLKLAIPLLVFTICKIHSQSNIEYNFSYIINTKKENIDLKKGIEKQLDKIYDKITEHIVNNDDKSIIEDIYI 
NQDIIKTELEISKLKKEiMDKKKLQNIITAKEKHNTKTKIDELKKNIQNINNKQKKFAEYFNNLKKLKVKYKKIEEQ 
TNISNLNKEFFIREELFFINYIDLKKIENYYLLEISNITPEKIETKKAVFKTSSSVNEIADHITKYSLKEILGREF 
LKINIWKNNSDAKIYINEKFVSKGIYHDNIFDISKLPWKEIEIQITSANFENYSIKRTVKNADSIILDIDLKRTI 
SKKVSIKSNVQSKVFKKGIFMGETPIEIEKPENQDIILLKSKGYKDKFKLINKEEDQVEIEMIKTNKKRLIDTRDK 
FYWLAVFTLSTIGAIFAGTLLNNSEVLYXITGNHFINKRLTAEDVYMAKAEQMTATFLFGVGITLTIGSFISLIT 
HLVEYI KEANMGE 

t888 .aa 

SNIEYNFSYIINTKKENIDLKKGIEKQLDKIYDKITEHIVNNDDKSIIEDIYINQDIIKTELEISKLKKEMDKKKL 
QNIITAKEKHNTKTKIDELKKNIQNINNKQKKFAEYFNNLKKLKVKYKKIEEQTNISNLNKEFFIREELFFINYID 
LKKIENYYLLEISNITPEKIETKXAVFKTSSSVNEIADHITKYSLKEILGREFLKININ^/KMNSDAKIYINEKFVS 
KGIYHDNIFDISKLPNKEIEIQITSANFENYSIKRTVKNADSIILDIDLKRTISKKVSTKSNVQSKVFKKGIFMGE 
TPIEIEKPENQDIILLKSKGYKDKFKLINKEEDQVEIEMIKTNKNRLIDTRDKFYVNLAVFTLSTIGAIFAGTLLN 
NSEVLYKITGNHFINKRLTAEDVYMAKAEQMTATFLFGVGITLTIGSFISLITKLVEYIKEANMGE 



f 888 .nt 
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ATGGAAAAGCTTAAACTAAAGCTAGCAATACCATTGCTAGTATTTACAATATGCAAAATACATTCTCAAAGTAATA 
TT GAATACAATTTTTCCTATATCATTAATACAAAAAAAGAAAATATTGACCTAAAAAAGGGTATTGAAAAACAATT 
GGACAAAATCTATGATAAAATAACAGAACATATAGTAAACAATGATGACAAGAGCATCATTGAAGACATTTATATA 
AAT C AAGAT AT AAT AAAAAC AGAAC T TGAAAT T AGC AAATT AAAAAAAGAAATGG AT AAAAAAAAAC TTC AAAAC A 
TAATAACCGCAAAAGAAAAGCATAACACCAAAACCAAAATTGATGAGCTTAAAAAAAATATTCAAAATATTAACAA 
TAAACAAAAAAAATTTGCAGAATATTTTAACAATTTAAAAAAACTAAAAGTAAAATATAAAAAAATCGAAGAGCAA 
ACAAATATATCAAATTTAAATAAAGAATTTTTTATAAGAGAAGAATTATTTTTTATTAACTATATTGATCTTAAAA 
AAATAGAAAATTATTATTTGCTAGAAATTAGCAACATCACTCCTGAGAAAATTGAGACTAAAAAAGCGGTATTTAA 
AACATCATCTTCTGTTAATGAAATTGCAGATCACATAACAAAATACAGCCTCAAAGAAATATTGGGCAGAGAATTT 
TTAAAAATCAACATTAACGTCAAAAATAACTCGGATGCAAAAATCTACATAAATGAAAAATTTGTTTCAAAAGGAA 
TCTATCACGiiTAATATTTTTGACATTTCTAAACTCCCAAACAAAGAAATTGAAATACAAATCACAAGTGCAAATTT 
CGAAAACTATTCTATTAAAAGAACGGTAAAAAATGCAGACTCAATAATATTAGATATTGACTTAAAAAGAACAATC 
TCTAAAAAAGTATCAATTAAAAGCAATGTACAATCTAAAGTTTTTAAAAAAGGAATATTTATGGGAGAAACCCCAA 
TTGAAATTGAAAAACCAGAAAATCAAGATATCATCTTGCTTAAATCTAAAGGATATAAAGATAAATTCAAGTTAAT 
AAATAAAGAAGAAGATCAAGTAGAAATAGAAATGATAAAAACTAACAAAAATAGACTTATCGACACAAGAGATAAA 
TTTTATG^CAATCTGGCCGTCTTTACATTAAGCACAATAGGAGCCATTTTTGCAGGAACATTGCTTAACAATTCAG 
AAGTACTTTATAAAATAACAGGCAATCACTTTATTAACAAAAGATTAACAGCAGAAGATGTTTATATGGCAAAAGC 
GGAACAAATGACTGCAACATTTCTATTTGGAGTAGGAATCACTTTAACTATTGGAAGCTTTATCTCATTAATAACT 
CATTTAGTAGAATATATTAAAGAAGCAAATATGGGAGAATAG 



ZVGTAATATTGAATACAATTTTTCCTATATCATTAATACAAAAAAAGAAAATATTGACCTAAAAAAGGGTATTGAAA 
AACAATTGGACAAAATCTATGATAAAATAACAGAACATATAGTAAACAATGATGACAAGAGCATCATTGAAGACAT 
TTATATAAATCAAGAT AT AATAAAAACAGAACTTGAAATT AGC AAATT AAAAAAAGAAATGGATAAAAAAAAACTT 
C AAAAC AT AAT AAC C GC AAAAGAAAAGCA.TAAC AC C AAAAC C AAAATTGATGAGC TTAAAAAAAAT ATT C AAAAT A 
TTAACAATAAACAAAAAAAATTTGCAGAATATTTTAACAATTTAAAAAAACTAAAAGTAAAATATAAAAAAATCGA 
AGAGCAAACAAATATATCAAATTTAAATAAAGAATTTTTTATAAGAGAAGAATTATTTTTTATTAACTATATTGAT 
CTTAAAAAAATAGAAAATTATTATTTGCTAGAAATTAGCAACATCACTCCTGAGAAAATTGAGACTAAAAAAGCGG 
TATTTAAAACATCATCTTCTGTTAATGAAATTGCAGATCACATAACAAAATACAGCCTCAAAGAAATATTGGGCAG 
AGAATTTTTAAAAATCAACATTAACGTCAAAAATAACTCGGATGCAAAAATCTACATAAATGAAAAATTTGTTTCA 
AAAGGAATCTATCACGATAATATTTTTGACATTTCTAAACTCCCAAACAAAGAAATTGAAATACAAATCACAAGTG 
CAAATTTCGAAAACTATTCTATTAAAAGAACGGTAAAAAATGCAGACTCAATAATATTAGATATTGACTTAAAAAG 
AACAATCTCTAAAAAAGTATCAATTAAAAGCAATGTACAATCTAAAGTTTTTAAAAAAGGAATATTTATGGGAGAA 
ACCCCAATTGAAATTGAAAAACCAGAAAATCAAGATATCATCTTGCTTAAATCTAAAGGATATAAAGATAAATTCA 
AGTTAATAAATAAAGAAGAAGATCAAGTAGAAATAGAAATGATAAAAACTAACAAAAATAGACTTATCGACACAAG 
AGATAAATTTTATGTCAATCTGGCCGTCTTTACATTAAGCACAATAGGAGCCATTTTTGCAGGAACATTGCTTAAC 
AATTCAGAAGTACTTTATAAAATAACAGGCAATCACTTTATTAACAAAAGATTAACAGCAGAAGATGTTTATATGG 
CAAAAGCGGAACAAATGACTGCAACATTTCTATTTGGAGTAGGAATCACTTTAACTATTGGAAGCTTTATCTCATT 
AATAACTCATTTAGTAGAATATATTAAAGAAGCAAATATGGGAGAATAG 

f893 -aa 

MVRFLGFLYLITTIPLIKSCDAAQFGDYKPLYFENENDLKTANEYINSLGYKTISEYTTKIDILDFPENKEITINE 

INKLNNLDLRKSIFLKKLSNLFNIEHKKLLYVENRFKSINFKNLKKELNINADIHSLDYKTKINFISSxIFLIIII 

LLIFLDPTNSIFTLIFLLISSLAFMISKEIMTYFYPFTVLSYLLFLIISNFNKjSTYNKIYLKEINFLTLMTKIKHLLF 

LFTFTALYFITITTFFTTNIDPTFIAFVAIPTLCIFLIFSWIKTESNFKDTFLFPIEIKEKKIEGKKALKSKIAIH 

LLLFTLSLIPFAYSSYMLNSYENINYLYSKKLNYFD^^ 

YGKIPKDIKENYFEIKNDKIEIHPKTVYEVDKSFIDEILK^^ 

FSLPFFVLLFLFKAIRFTILLNIN 

EKTYKKYIQG 

t893.aa 

CDAAQFGDYKPLYFENENDLKTANEYINSLGYKTISEYTTKIDILDFPENKEITINEINXLNNLDLRKSIFLKKLS 
NLFNIEHKKLLYVENRFKSINFKNLKKELNINADIHSLDYKTKINFISSIIFLIIIILLIFLDPTNSIFTLIFLLI 
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SSLAFMISKEIMYFYPFTVLSYLLFLIISNFNKNYNKIYLKEINFLTLMTKIKHLLFLFTFTALYFITITTFFTTN 
IDPTFIAFVAIPTLCIFLIFSWIKTESNFKDTFLFPIEIKEKKIEGKKALKSKIAIHLLLFTLSLIPFAYSSYMLN 
SYENINYLYSKKLNYFDYLNPNNIYIMLGYNKDMPNIIGYLSHILYQNELKYNITAKYGKIPKDIKEKnrTFEIKOTK 
IEIHPKTWEVDKSFIDEILKKDLASLFLKNKNPILIYKENKNNINTDKKKYKILFFFSLPFFVLLFLFKAIRFTI 
LLNINEKTYKKYIQG 

f893 .nt 

ATGGTGCGTTTTTTAGGTTTTTTATATTTAATTACAACAATACCACTTATCAAATCCTGTGATGCAGCTCAATTTG 
GAGACTACAAACCTTTATACTTTGAAAATGAAAATGATCTAAAAACTGCCAATGAATATATAAATTCACTAGGATA 
C AAAAC AATCTC AGAAT AC AC AAC AAAAATTGAC ATTTTAGAC TTT CC C GAAAAT AAAGAAATC AC AAT AAATGAG 
ATAAACAAACTTAACAATCTTGACCTGAGAAAAAGCATATTTTTAAAAAAGCTCTCCAATCTTTTCAACATAGAGC 
ACAAAAAACTTCTTTATGTTGAAAACAGGTTTAAAAGTATAAATTTTAAAAACCTAAAAAAAGAACTCAATATTAA 
TGCCGACATACATTCTCTTGACTACAAAACAAAAATTAATTTTATTTCAAGCATAATATTTCTAATCATAATAATT 
TTATTAATTTTTTTAGACCCAACAAACTCTATATTTACTTTAATTTTTCTATTAATTTCATCTCTTGCTTTTATGA 
TAAGCAAAGAAATAATGTATTTTTATCCATTTACAGTTCTCTCTTATTTGTTATTTTTAATAATCAGTAATTTTAA 
CAAAAATTACAATAAAATATATTTAAAAGAAATAAATTTTTTAACACTAATGACAAAAATAAAACACTTACTATTT 
TTATTTACATTCACAGCTCTATATTTCATTACAATCACAACCTTTTTTACTACAAATATTGATCCCACTTTTATTG 
CATTTGTCGCAATACCAACCCTTTGCATTTTCTTAATTTTCAGCTGGATAAAAACAGAAAGCAATTTTAAAGACAC 
TTTCTTATTCCCAATCGAGATTAAAGAGAAAAAAATAGAAGGAAAAAAAGCTTTAAAATCAAAAATAGCAATACAT 
CTACTACTATTTACACTCTCATTAATTCCTTTCGCTTATTCAAGCTATATGCTAAATTCTTATGAAAACATTAACT 
ACCTTTACAGTAAAAAATTAAATTACTTTGATTATTTAAATCCTAATAACATTTATATAATGCTGGGATACAACAA 
AGACATGCCCAATATTATAGGGTACCTATCCCACATTCTTTATCAAAACGAACTAAAATACAATATTACCGCTAAG 
TATGGAAAAATTC CT AAAGAT AT AAAAGAAAATT AC TTTG AAATC AAAAACGAC AAAAT AGAAATTC ATC CTAAAA 
CTGTTTACGAAGTAGACAAATCATTTATTGATGAAATTCTTAAAAAAGATCTTGCAAGTCTGTTTTTAAAAAATAA 
AAATCCAATCCTAATATATAAAGAAAACAAGAATAATATCAACACAGATAAAAAAAATTACAAAATACTTTTCTTT 
TTCTCTTTGCCCTTCTTTGTATTACTATTCCTATTTAAAGCAATAAGATTTACAATTCTTTTAAACATAAATGAAA 
AAACCTATAAAAAATATATTCAAGGATAA 

t893 .nt 

TGTGATGCAGCTCAATTTGGAGACTACAAACCTTTATACTTTGAAAATGAAAATGATCTAAAAACTGCCAATGAAT 
ATATAAATTCACTAGGATACAAAACAATCTCAGAATACACAACAAAAATTGACATTTTAGACTTTCCCGAAAATAA 
AGAAATCACAATAAATGAGATAAACAAACTTAACAATCTTGACCTGAGAAAAAGCATATTTTTAAAAAAGCTCTCC 
AATCTTTTCAACATAGAGCACAAAAAACTTCTTTATGTTGAAAACAGGTTTAAAAGTATAAATTTTAAAAACCTAA 
AAAAAGAACTCAATATTAATGCCGACATACATTCTCTTGACTACAAAACAAAAATTAATTTTATTTCAAGCATAAT 
ATTTCTAATCATAATAATTTTATTAATTTTTTTAGACCCAACAAACTCTATATTTACTTTAATTTTTCTATTAATT 
TCATCTCTTGCTTTTATGATAAGCAAAGAAATAATGTATTTTTATCCATTTACAGTTCTCTCTTATTTGTTATTTT 
TAATAATCAGTAATTTTAACAAAAATTACAATAAAATATATTTAAAAGAAATAAATTTTTTAACACTAATGACAAA 
AATAAAACACTTACTATTTTTATTTACATTCACAGCTCTATATTTCATTACAATCACAACCTTTTTTACTACAAAT 

attgatcccacttttattgcatttgtcgcaataccaaccctttgcattttcttaattttcagctggataaaaacag 
aaagcaattttaaagacactttcttattcccaatcgagattaaagagaaaaaaatagaaggaaaaaaagctttaaa 
'atcaaaaatagcaatacatctac.tactatttacactctcattaattcctttcgcttattcaagctatatgctaaat 

TCTTATGAAAACATTAACTACCTTTACAGTAAAAAATTAAATTACTTTGATTATTTAAATCCTAATAACATTTATA 

taatgctgggatacaacaaagacatgcccaatattatagggtacctatcccacattctttatcaaaacgaactaaa 
atacaatattaccgctaagtatggaaaaj^ttcctaaagatataaaagaaaattactttgaaatcaaaaacgacaaa 
atagaaattcatcctaaaactgtttacgaagtagacaaatcatttattgatgaaattcttaaaaaagatcttgcaa 
gtctgtttttaaaaaataaaaatccaatcctaatatataaagaaaacaagajvtaatatcaacacagataaaaaaaa 
ttacaaaatacttttctttttctctttgcccttctttgtattactattcctatttaaagcaataagatttacaatt 
cttttaaacataaatgaaaaaacctataaaaaatatattcaaggataa 

f 895 .aa 

miralltndlflsclvsgisaqvikygiqtvktrklkltpvhllkkifletggmpsshsstvtalstsialtegid 
tnfiialafalitirdsfgvrymsgvqasylnalseklkkeikidttkikwkghkkkevltgiiigivsayivcy 

F 
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t895 . aa 

AQVIKYGIQTVKTRKLKLTPVHLLKKIFLETGGMPSSHSSTVTALSTSIALTEGIDTNFIIALAFALITIRDSFGV 
RYMSGVQAEYLNALSEKLKKEIKIDTTKIKWKGHKKKEVLTGIIIGIVSAYIVCYF 

f 895 .nt 

ATGATAAGGGCATTGCTTACCAATGATCTTTTTTTGTCTTGTCTTGTATCAGGAATTTCTGCTCAAGTGATTAAAT 
ATGGTATCCAAACTGTAAAAACAAGAAA.GTTAAAACTAACTCCAGTACATCTTTTAAAAAAAATTTTTCTAGAAAC 
AGGAGGCATGCCAAGTAGTCATTCATCAACGGTCACCGCTCTTTCAACCTCAATCGCACTAACTGAAGGAATAGAT 
ACAAATTTTATAATAGCTCTTGCATTTGCCCTTATTACAATAAGAGATTCTTTCGGCGTAAGATATATGTCTGGAG 
TTCAAGCAGAATATTTAAATGCATTATCAGAAAAATTAAAAAAAGAAATAAAAATTGACACAACAAAAATAAAAGT 
GGTCAAGGGGCACAAAAAGAAAGAGGTTCTAACGGGCATAATAATAGGAATAGTCTCTGCGTATATTGTGTGCTAT 
TTTTAG 

t895.nt 

GCTCAAGTGATTAAATATGGTATCCAAACTGTAAAAACAAGAAAGTTAAAACTAACTCCAGTACATCTTTTAAAAA 
AAATTTTTCTAGAAACAGGAGGCATGCCAAGTAGTCATTCATCAACGGTCACCGCTCTTTCAACCTCAATCGCACT 
AACTGAAGGAATAGATACAAATTTTATAATAGCTCTTGCATTTGCCCTTATTACAATAAGAGATTCTTTCGGCGTA 
AGATATATGTCTGGAGTTCAAGCAGAATATTTAAATGCATTATCAGAAAAATTAAAAAAAGAAATAAAAATTGACA 
CAACAAAAATAAAAGTGGTCAAGGGGCACAAAAAGAAAGAGGTTCTAACGGGCATAATAATAGGAATAGTCTCTGC 
GTATATTGTGTGCTATTTTTAG 

f 605 .aa 

MYIGAAGKSFSIIIDSAFLSNCFLFIGSFSRSDSLMSLSNSRFEYPYDASCEFSLVNIVKYVCGSKYSPMRPTLII 
SKLPVFLLLVRTGQFSLVSIRLIFRIFFHWFZ 

t605.aa 

CFLFIGSFSRSDSLMSLSNSRFEYPYDASCEFSLVNIVKYVCGSKYSPMRPTLIISKL PVFLLLVRTGQFSLVSIR 
LIFRIFFHWFZ 

f605.nt 

ATGTATATTGGTGCAGCAGGAAAATCTTTTTCAATTATTATTGATTCTGCTTTTCTGAGTAATTGTTTTCTTTTTA 
TAGGATCTTTTTCAAGATCTGATTCTCTGATGAGTTTGTCAAATTCTAGGTTTGAATATCCGTATGATGCAAGTTG 
TGAATTTTCTCTTGTGAATATAGTAAAGTATGTGTGTGGATCTAAATATTCCCCAATGCGTCCAACTCTTATTATT 
TCAAAATTGCGAGTATTTCTGCTGTTGGTAAGAACAGGCCAATTTTCGTTGGTAAGCATAAGATTGATATTTAGAA 
TTTTTTTCCATTGGTTTTGA 

t€05 .nt 

TGTTTTCTTTTTATAGGATCTTTTTCAAGATCTGATTCTCTGATGAGTTTGTCAAATTCTAGGTTTGAATATCCGT 
ATGATGCAAGTTGTGAATTTTCTCTTGTGAATATAGTAAAGTATGTGTGTGGATCTAAATATTCCCCAATGCGTCC 
AACTCTTATTATTTCAAAATTGCCAGTATTTCTGCTGTTGGTAAGAACAGGCCAATTTTCGTTGGTAAGCATAAGA 
TTGATATTTAGAATTTTTTTCCATTGGTTTTGA 

f 606 . aa 

MKLQRSLFLIIFFLTFLCCNNKERKEGVSFKISLGAEPSSLDPQLAEDNVASKMIDTMFRGIVTGDPNTGGNKPGL 
AKGWDISSDGTVYTFNLREKITWSDGVAITAEGIRKSYLRILNKETGSKYVTEMVKSVIKNGQKYFDGQVTDSELGI 
RAIDEKTLEITLESPKPYFIDMLVHQSFIPVPVH^TEKYGQtMTSPENMVTSGPFKLKERIPNEKYVFEKISlNKYYD. 
SNEVELEEITFYTTNDSSTAYKMYENEELDAIFGSIPPDLIKNLKLRSDYYSSAVNAIYFYAFNTHIKPLDNVKIR 
KALTLAIDRETLTYKVLDNGTTPTRRATPNFSSYSYAKSLELFNPEIAKTLLAEAGYPNGNGFPILKLKYNTNEAN 
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KKICEFIQNQWKKNLNIDVELENEEWTTYLNTKANGNYEIARAGWIGDYADPLTFLSIFTQGYTQFSSHNYSNPEY 
NELIKKSDLELDPIKRQDILRQAEEIIIEKDFPIAPIYIYGNSYLFRNDKWTGWNTNILERFDLSQLKLKNKZ 

t606 . aa 

CCNNKERKEGVSFKISLGAEPSSLDPQLAEDWASKMIDTMFRGIVTGDPNTGGNKPGLAKGWDISSDGTVYTFNL 
REKITWSDGVAITAEGIRKSYLRILNKETGSKYVEMVKSVIKNGQKYFDGQVTDSELGIRAIDEKTLEITLESPKP 
YFIDMLVHQSFIPVPVHVTEKYGQNWTSPENMVTSGPFKLKER 

STAYKMYENEELDAIFGSIPPDLIKNLKLRSDYYSSAVNAIYFYAFNTHIKPLDNVKIRKALTLAIDRETLTYKVL 
DNGTTPTRFJ^TPNFSSYSYAKSLELFNPEIAKTLLAEAGYPNGNGFPILKLKYNTNEANKKICEFIQNQWKKNLNI 
DVELENEEWTTYLNTKANGNYEIARAGWIGDYADPLTFLSIFTQGYTQFSSHNYSNPEYNELIKKSDLELDPIKRQ 
DILRQAEEIIIEKDFPIAPIYIYGNSYLFRNDKWTGWNTNILERFDLSQLKLKNKZ 

f 606 .nt 

ATGAAATTACAAAGGTCATTATTTTTAATAATATTTTTTCTAACTTTTCTTTGTTGTAATAACAAGGAAAGAAAAG 
AAGGAGTATCATTTAAAATAAGCTTGGGAGCAGAGCCAAGCAGTCTTGACCCTCAATTAGCAGAGGATAATGTCGC 
ATC AAAAATGATTGAC AC AATGTTTAGAGGG ATTGTTACAGGAGAT C CT AATAC AGGGGGAAAT AAAC C GGGACTT 
GCAAAAGGGTGGGATATTTCTTCTGATGGAACAGTTTACACATTTAACCTAAGAGAAAAAATCACTTGGAGTGACG 
GAGTTGCAATCACTGCAGAAGGAATTAGAAAATCTTATCTTAGAATTTTAAATAAAGAAACTGGCTCAAAGTACGT 
TGAAATGGTTAAATCGGTAATTAAAAATGGTCAAAAATATTTTGATGGACAAGTGACTGACTCTGAACTTGGAATT 
AGAGCGATTGATGAAAAAACATTAGAAATAACACTGGAATCACCAAAACCTTATTTTATTGATATGTTAGTACACC 
AATCATTTATTCCAGTACCAGTTCATGTTACCGAAAAGTATGGACAAAACTGGACAAGCCCCGAAAACATGGTGAC 
AAGTGGTCCTTTTAAATTAAAAGAAAGAATTCCTAACGAAAAATATGTCTTTGAAAAAAATAACAAATACTACGAC 
TCAAATGAAGTAGAATTAGAAGAGATTACATTTTACACAACAAATGACAGCTCAACAGCGTATAAAATGTATGAAA 
ATGAAGAGCTAGATGCAATTTTTGGTTCCATACCCCCAGATCTAATCAAAAATCTAAAATTAAGAAGCGACTATTA 
CTCATCAGCTGTTAATGCCATATACTTTTACGCGTTCAATACACACATCAAACCACTTGACAACGTTAAAATTAGA 
AAAGCCTTAACTCTTGCTATTGACAGAGAAACGCTTACATATAAAGTTCTTGACAACGGGACTACCCCTACAAGAA 
GAGCAACTCCCAACTTTAGTTCATATTCTTATGCAAAAAGTTTAGAATTATTTAATCCTGAAATTGCAAAAACCCT 
TCTAGCTGAAGCTGGATATCCTAATGGCAATGGATTTCCAATTTTAAAATTAAAATACAATACAAACGAAGCAAAT 
AAAAAAATTTGTGAATTTATTCAAAACCAATGGAAAAAAAATTTAAATATTGATGTGGAACTTGAAAACGAAGAAT 
GGACAACATACTTAAACACTAAGGCAAATGGAAATTATGAAATAGCAAGAGCAGGATGGATAGGCGATTATGCTGA 
TCCTTTGACATTTTTAAGCATATTCACACAAGGATACACACAATTCTCATCTCATAATTACTCAAACCCAGAATAC 
AACGAACTTATAAAGAAATCCGACCTTGAGCTTGATCCAATAAAAAGACAAGACATTTTAAGACAAGCAGAAGAGA 
TAATTATTGAAAAAGATTTTCCAATAGCACCAATATACATATATGGGAACAGTTACCTTTTCAGAAATGACAAATG 
GACAGGGTGGAACACCAATATTTTAGAAAGATTTGATTTATCTCAGCTAAAATTAAAAAATAAATAA 

t606 .nt 

TGTTGTAATAACAAGGAAAGAAAAGAAGGAGTATCATTTAAAATAAGCTTGGGAGCAGAGCCAAGCAGTCTTGACC 
CTCAATTAGCAGAGGATAATGTCGCATCAAAAATGATTGACACAATGTTTAGAGGGATTGTTACAGGAGATCCTAA 
TACAGGGGGAAATAAACCGGGACTTGCAAAAGGGTGGGATATTTCTTCTGATGGAACAGTTTACACATTTAACCTA 
AGAGAAAAAATCACTTGGAGTGACGGAGTTGCAATCACTGCAGAAGGAATTAGAAAATCTTATCTTAGAATTTTAA 
ATAAAGAAACTGGCTCAAAGTACGTTGAAATGGTTAAATCGGTAATTAAAAATGGTCAAAAATATTTTGATGGACA 
AGTGACTGACTCTGAACTTGGAATTAGAGCGATTGATGAAAAAACATTAGAAATAACACTGGAATCACCAAAACCT 
TATTTTATTGATATGTTAGTACACCAATCATTTATTCCAGTACCAGTTCATGTTACCGAAAAGTATGGACAAAACT 
" GGACAAGCCCCGAAAACATGGTGACAAGTGGTCCTTTTAAATTAAAAGAAAGAATTCCTAACGAAAAATATGTCTT 
TGAAAAAAATAACAAATACTACGACTCAAATGAAGTAGAATTAGAAGAGATTACATTTTACACAACAAATGACAGC 
TCAACAGCGTATAAAATGTATGAAAATGAAGAGCTAGATGCAATTTTTGGTTCCATACCCCCAGATCTAATCAAAA 
ATCTAAAATTAAGAAGCGACTATTACTCATCAGCTGTTAATGCCATATACTTTTACGCGTTCAATACACACATCAA 
ACCACTTGACAACGTTAAAATTAGAAAAGCCTTAACTCTTGCTATTGACAGAGAAACGCTTACATATAAAGTTCTT 
GACAACGGGACTACCCCTACAAGAAGAGCAACTCCCAACTTTAGTTCATATTCTTATGCAAAAAGTTTAGAATTAT 
TTAATCCTGAAATTGCAAAAACCCTTCTAGCTGAAGCTGGA'TATCCTAATGGCAATGGATTTCCAATTTTAA^ATT 
AAAATACAATACAAACGAAGCAAATAAAAAAATTTGTGAATTTATTCAAAACCAATGGAAAAAAAATTTAAATA'TT 
GATGTGGAACTTGAAAACGAAGAATGGACAACATACTTAAACACTAAGGCAAATGGAAATTATGAAATAGCAAGAG 
CAGGATGGATAGGCGATTATGCTGATCCTTTGACATTTTTAAGCATATTCACACAAGGATACACACAATTCTCATC 




TABLE 1 . Nucleotide and Amino Acid Sequences 

TC ATAATTACTCAAACC CAGAATACAACGAAC TTAT AAAGAAATCC GAC CTTGAGCTTGATCCAATAAAAAGACAA 
GACATTTTAAGACAAGCAGAAGAGATAATTATTGAAAAAGATTTTCCAATAGCACCAATATACATATATGGGAACA 
GTTACCTTTTCAGAAATGACAAATGGACAGGGTGGAACACCAATATTTTAGAAAGATTTGATTTATCTCAGCTAAA 
ATTAAAAAATAAATAA 

f 679 .aa 

MFNRSSCVLQNFLFLFLFLSLVSCFAKKEISGMNTFIKAHSKEFDLNNLNWLWNFDYTKKNFDKHFNIDPSSYTYVA 
YLFKKIGFEEKFVEYMKKAIANGDSIASQFAGIKLIEYFNSAKEYFASELIGEKLYKKYENNKFIILGYFKSLYWQ 
KKNDKALSLLNKLDKMKFSDYQENENILLKAVLYLNLSNVSESKIYFNELFENLPANYLHVRAYDYFIIENKSRYF 
GANFLNLVRFKYEVANGNFNGAINILNKNGLNDYYDNNIVLSDVYKAFISSGKVSNALTFFSKIKSKYKNYYLGIL 
NLREKNNLGLLLLKEYLEGLDLMNEINRLDLLNTAFSNLIFTKSARDYFAESLPKFYTEGDKKNSTFIKILEEYIL 
ESIQLEDYGNLYKLYSNAQKVISNSVLSKLAFINARLIYHKLIKPNVSGEYKSLLHSAVNYDKWSYSSFMSRYLLD 
QNIDEFFTGGSDIKYEQSDYEIFLEGFLKFNLCNYVRGFISEDFRWGYKFSLDFYRKVYDELLKSENYYDATLVIN 
YLTOQDESALMENDYKRLYPYLYGSLIEYWAKRRGLEASWFSLIKAESSFEKNAVSKPGAVGLMQVMPSTANDIS 
KELKYFNYDLKIPKDNIIIGTYYLKKRISTTGSLYKALASYNGGIGNVRKWEKSYGHLSKELFIEAIPFSQTRNYI 
KKILVYSVFYDALYEKKGIDSVIVKIMGEFPKNZ 

t679 .aa 

CFAKKEISGmFIKAHSKEFDLNNLNWLWNFDYTKKNFDKHFNIDPSSYIYVAYLFKKIGFEEKFVEYMKKAIANG 
DSIASQFAGIKLIEYFNSAKEYFASELIGEKLYKKYENNKFIILGYFKSLYWQKKNDKALSLLNKLDKMKFSDYQE 
NENILLKAVLYLNLSNVSESKIYFNELFENLPANYLHVRAYDYFIIENKSRYFGANFLNLVRFKYEVANGNFNGAI 
NILNKNGLNDYYDNNIVLSDWKAFISSGKVSNALTFFSKIKSKYKNYYLGILNLREKNNLGLLLLKEYLEGLDLN 
NEINRLDLLNTAFSNLIFTKSARDYFAESLPKFYTEGDKKNSTFIKILEEYILESIQLEDYGNLYKLYSNAQKVIS 
NSVLSKLAFINARLIYHKLIKPNVSGEYKSLLHSAVNYDKWSYSSFMSRYLLDQNIDEFFTGGSDIKYEQSDYEIF 
LEGFLKFNLCISrYWGFISEDFFJ^GYKFSLDFYRKVYDELLKSENYYDATLVINYLVNQDESALMENDYKRLYPYLY 
GSLIEYWAKRRGLEASWFSLIKAESSFEKNAVSKPGAVGLMQVMPSTANDISKELKYFNYDLKIPKDNIIIGTYY 
LKKRISTTGSLYKALASYNGGIGNVRKWEKSYGHLSKELFIEAIPFSQTRNYIKKILVYSVFYDALYEKKGIDSVI 
VKIMGEFPKNZ 

f 679 .nt 

ATGTTTAATAGAAGTTCTTGTGTATTACAAAATTTTCTTTTTCTTTTTTTATTTTTAAGTTTAGTTTCTTGCTTTG 
CAAAAAAAGAAATCTCAGGCAATAATTTTATTAAGGCGCATTCAAAAGAGTTTGATTTAAATAATTTAAATTGGTT 
ATGGAATTTTGATTATACAAAAAAAAATTTTGATAAGCATTTTAACATAGATCCAAGTTCTTACATATATGTTGCT 
TATTTATTTAAAAAAATAGGATTTGAAGAGAAATTTGTAGAGTATATGAAAAAGGCCATAGCTAATGGAGATAGCA 
TTGCATCCCAGTTTGCTGGGATTAAGCTTATTGAATATTTTAACTCAGCAAAAGAGTATTTTGCATCTGAATTGAT 
TGGAGAGAAGCTTTATAAAAAATACGAAAATAATAAATTTATTATACTGGGGTACTTTAAAAGTCTTTATTGGCAA 
AAGAAAAACGATAAGGCACTTAGTCTTTTAAATAAGCTTGATAAGATGAAATTTTCTGATTATCAGGAAAATGAAA 
ATATTTTATTAAAAGCAGTTCTTTACCTTAATCTTTCTAATGTAAGTGAGTCAAAAATTTATTTTAATGAGCTTTT 
TGAGAACTTACCTGCAAATTATTTACATGTAAGAGCTTATGATTATTTTATTATTGAAAATAAGTCTAGGTATTTT 
GGTGCAAATTTTTTAAATCTTGTTAGATTTAAGTATGAAGTGGCAAATGGCAATTTTAATGGTGCAATAAATATAT 
TAAATAAAAATGGTTTAAATGATTATTATGACAATAACATTGTATTAAGTGATGTTTATAAC-GCTTTTATTAGTTC 
TGGCAAAGTTTCAAATGCTTTAACATTTTTTAGTAAAATAAAGAGCAAATATAAAAATTATTATTTAGGTATTCTA 
AACCTTAGAGAGAAAAATAATTTAGGACTTCTTCTTTTAAAAGAATATCTTGAAGGTTTAGATCTTAACAATGAGA. 
TTAACAGGCTTGATTTGCTTAATACTGCTTTTAGCAATTTAATTTTTACTAAGAGCGCAAGGGATTATTTTGCCGA 
AAGTTTACCCAAGTTTTATACCGAGGGCGATAAAAAAAATTCTACTTTTATTAAGATTTTAGAAGAGTATATTTTG 
GAATCAATTCAGCTTGAAGACTATGGCAATCTTTATAAGCTTTATTCTAATGCTCAAAAAGTTATTTCTAATTCTG 
TTTTGTCTAAGCTTGCTTTTATTAATGCAAGGCTTATATATCATAAATTAATTAAACCTAACGTAAGCGGAGAATA 
CAAGAGTCTTTTGCATTCTGCTGTTAATTATGATAAATGGTCTTATTCTTCATTTATGAGTAGGTACTTATTAGAT 
CAAAATATTGATGAATTTTTTACAGGTGGGTCTGATATTAAGTATGAGCAATCCGATTATGAGATTTTTTTGGAAG 
GGTTTTTAAAATTCAATCTTTGTAATTATGTTAGAGGGTTTATTTCTGAGGATTTTAGGAATGGATATAAATTTTC 
ACTTGATTTTTATCGAAAAGTATACGATGAACTTTTAAAGAGTGAAAATTATTACGATGCAACTCTTGTGATTAAT 
TATCTTGTAAATCAAGATGAATCTGCTTTAATGGAGAATGACTATAAAAGACTTTATCCTTATTTGTATGGATCTT 
TGATAGAATATTGGGCTAAAAGGAGAGGGCTTGAAGCTAGTGTTGTATTTTCTTTAATAAAAGCAGAGAGTAGCTT 
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• TGAAAAAAATGCTGTCTCAAAACCGGGTGCTGTTGGCCTTATGCAGGTTATGCCATCAACAGCAAATGATATTTCT 
AAAGAACTTAAGTATTTTAACTATGATTTAAAGATTCCAAAAGATAATATAATAATTGGAACATATTATTTAAAAA 
AAAGAATATCTACAACTGGCAGTCTTTATAAGGCTCTTGCGTCTTATAATGGGGGTATTGGTAATGTTAGAAAGTG 
GGAGAAAAGTTATGGACATTTGTCAAAAGAGCTTTTTATTGAGGCAATTCCCTTTAGTCAAACTAGGAATTATATT 
AAAAAAATATTAGTTTATTCGGTATTTTATGATGCTTTGTATGAAAAGAAGGGAA.TAGATTCAGTAATAGTTAAAA 
TTATGGGC GAATTC CC C AAAAATTAA 

t679.nt 

TGCTTTGCAAAAAAAGAAATCTCAGGCAATAATTTTATTAAGGCGCATTCAAAAGAGTTTGATTTAAATAATTTAA 
ATTGGTTATGGAATTTTGATTATACAAAAAAAAATTTTGATAAGCATTTTAACATAGATCCAAGTTCTTACATATA 
TGTTGCTTATTTATTTAAAAAAATAGGATTTGAAGAGAAATTTGTAGAGTATATGAAAAAGGCCATAGCTAATGGA 
GATAGCATTGCATCCCAGTTTGCTGGGATTAAGCTTATTGAATATTTTAACTCAGCAAAAGAGTATTTTGCATCTG 
AATTGATTGGAGAGAAGCTTTATAAAAAATACGAAAATAATAAATTTATTATACTGGGGTACTTTAAAAGTCTTTA 
TTGGCAAAAGAAAAACGATAAGGCACTTAGTCTTTTAAATAAGCTTGATAAGATGAAATTTTCTGATTATCAGGAA 
AATGAAAATATTTTATTAAAAGCAGTTCTTTACCTTAATCTTTCTAATGTAAGTGAGTCAAAAATTTATTTTAATG 
AGCTTTTTGAGAACTTACCTGCAAATTATTTACATGTAAGAGCTTATGATTATTTTATTATTGAAAATAAGTCTAG 
GTATTTTGGTGCAAATTTTTTAAATCTTGTTAGATTTAAGTATGAAGTGGCAAATGGCAATTTTAATGGTGCAATA 
AATATATTAAATAAAAATGGTTTAAATGATTATTATGACAATAACATTGTATTAAGTGATGTTTATAAGGCTTTTA 
TTAGTTCTGGCAAAGTTTCAAATGCTTTAACATTTTTTAGTAAAATAAAGAGCAAATATAAAAATTATTATTTAGG 
TATTCTAAACCTTAGAGAGAAAAATAATTTAGGACTTCTTCTTTTAAAAGAATATCTTGAAGGTTTAGATCTTAAC 
AATGAGATTAACAGGCTTGATTTGCTTAATACTGCTTTTAGCAATTTAATTTTTACTAAGAGCGCAAGGGATTATT 
TTGCCGAAAGTTTACCCAAGTTTTATACCGAGGGCGATAAAAAAAATTCTACTTTTATTAAGATTTTAGAAGAGTA 
TATTTTGGAATCAATTCAGCTTGAAGACTATGGCAATCTTTATAAGCTTTATTCTAATGCTCAAAAAGTTATTTCT 
AATTCTGTTTTGTCTAAGCTTGCTTTTATTAATGCAAGGCTTATATATCATAAATTAATTAAACCTAACGTAAGCG 
GAGAATACAAGAGTCTTTTGCATTCTGCTGTTAATTATGATAAATGGTCTTATTCTTCATTTATGAGTAGGTACTT 
ATTAGATCAAAATATTGATGAATTTTTTACAGGTGGGTCTGATATTAAGTATGAGCAATCCGATTATGAGATTTTT 
TTGGAAGGGTTTTTAAAATTCAATCTTTGTAATTATGTTAGAGGGTTTATTTCTGAGGATTTTAGGAATGGATATA 
AATTTTCACTTGATTTTTATCGAAAAGTATACGATGAACTTTTAAAGAGTGAAAATTATTACGATGCAACTCTTGT 
GATTAATTATCTTGTAAATCAAGATGAATCTGCTTTAATGGAGAATGACTATAAAAGACTTTATCCTTATTTGTAT 
GGATCTTTGATAGAATATTGGGCTAAAAGGAGAGGGCTTGAAGCTAGTGTTGTATTTTCTTTAATAAAAGCAGAGA 
GTAGCTTTGAAAAAAATGCTGTCTCAAAACCGGGTGCTGTTGGCCTTATGCAGGTTATGCCATCAACAGCAAATGA 
TATTTCTAAAGAACTTAAGTATTTTAACTATGATTTAAAGATTCCAAAAGATAATATAATAATTGGAACATATTAT 
TTAAAAAAAAGAATATCTACAACTGGCAGTCTTTATAAGGCTCTTGCGTCTTATAATGGGGGTATTGGTAATGTTA 
GAAAGTGGGAGAAAAGTTATGGACATTTGTCAAAAGAGCTTTTTATTGAGGCAATTCCCTTTAGTCAAACTAGGAA 
TTATATTAAAAAAATATTAGTTTATTCGGTATTTTATGATGCTTTGTATGAAAAGAAGGGAATAGATTCAGTAATA 
GTTAAAATTATGGGC GAATTC CC CAAAAATTAA 

f 11-12. nt 

TAAAAGGAGA ATATTTTTAT GAGAAAAAGT TTGTTTTTAT ATGCATTATT AATGGGAGGA 
TTGATGTCTT GTAATCTAGA TTCCAAATTA TCTAGTAACA AAGAACAAAA AAATAACAAT 
AATGTAAAAG AAGTTTCGGA TAGTGTTCAA GAAGATGGTC TTAATGATTT ATATAATAAT 
CAAGAAAAGC AAAAAAGCTT TACTAAAAAT TTTGGAGAAC GGAAAT AT GA GGATTTAATT 
AATCCTATAG AGCCTATAAT ACCTTCAGAA TCACCAAAGA ATAAGGCTAA TATACCAAAT 
ATTTCAATTG CGCATACTGA AAAAAAAGAG ACAAAAAAGG AGAATTTAAT CCCTTCTACT 
AATGAAGAAA AGGAAGCTGA TGCAGCAATT AAATATTTAG AAGAAAATAT TCTTAAAAAC 
TCTAAATTTT CTGAATTAAT TAGAGAAGTA CGTGTAATTA AAGATGAATA TGCTTTAATA 
AAAGC TGATT TGTATGATGT AATTGGAAAG ATTAACAATA AAAAAACATC ATTAATGGAG 
AATCCTAAGA ACAATAGAGA TAAGATAAAT AAATTAACAC AATTGTTGCA AAATAATTTA 
AAGATAGATA GTGAACTTGA GCAGCTTATA AATATGATTG ATATGGCAGA AAATGAAATA 
AGCTCTGCGG CTTTCTTTTT TGACAACGCT CAGAAAAGGT TAAAAGAAAG CATT ATT AAA 
AGATTAGAGA GTAAAAATAA TAGATCTTAT GCATTAAAAT TGTCTAGACA GGCTTTAAGT 
GACGCAAGAA GTGCTTTAAG TAATTTAGAA TCTTTTGCCT CTAAAAGAAT TGAACCAATG 
GTGAGAAAGG AAGAAATAAA AGAGCTTATT AAACATGCAA AAACTGTTTT AGAAAGTCTC 
AATAAAAAAT AA 
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tll-12 .nt 

TTGTAATCTAGATTCCAAATTATCTAGTAACAAAGAACAAAAAAATAACAATAATGTAAAAGAAGTTTCGGATAGT 
GTTCAAGAAGATGGTCTTAATGATTTATATAATAATCAAGAAAAGCAAAAAAGCTTTACTAAAAATTTTGGAGAAC 
GGAAATATGAGGATTTAATTAATCCTATAGAGCCTATAATACCTTCAGAATCACCAAAGAATAAGGCTAATATACC 
AAATATTTCAATTGCGCATACTGAAAAAAAAGAGACAAAAAAGGAGAATTTAATCCCTTCTACTAATGAAGAAAAG 
GAAGCTGATGCAGCAATTAAATATTTAGAAGAAAATATTCTTAAAAACTCTAAATTTTCTGAATTAATTAGAGAAG 
TACGTGTAATTAAAGATGAATATGCTTTAATAAAAGCTGATTTGTATGATGTAATTGGAAAGATTAACAATAAAAA 
AACATCATTAATGGAGAATCCTAAGAACAATAGAGATAAGATAAATAAATTAACACAATTGTTGCAAAATAATTTA 
AAGATAGATAGTGAACTTGAGCAGCTTATAAATATGATTGATATGGCAGAAAATGAAATAAGCTCTGCGGCTTTCT 
TTTTTGACAACGCTCAGAAAAGGTTAAAAGAAAGCATTATTAAAAGATTAGAGAGTAAAAATAATAGATCTTATGC 
ATTAAAATTGTCTAGACAGGCTTTAAGTGACGCAAGAAGTGCTTTAAGTAATTTAGAATCTTTTGCCTCTAAAAGA 
ATTGAACCAATGGTGAGAAAGGAAGAAATAAAAGAGCTTATTAAACATGCAAAAACTGTTTTAGAAAGTCTCAATA 
AAAAA 

fll-12 .aa 

KENIFMRKSL FLYALLMGGL MSCNLDSKLS SNKEQKNNNN VKEVSDSVQE DGLNDLYNNQ 
EKQKSFTKNF GERKYEDLIN PIEPIIPSES PKNKANIPNI S I AHTEKKET KKENLIPSTN 
EEKEADAAIK YLEENILKNS KFSELIREVR VIKDEYALIK ADLYDVIGKI NNKKTSLMEM 
PKNNRDKINK LTQLLQNNLK IDSELEQLIN MIDMAENEIS SAAFFFDNAQ KRLKESIIKR 
LESKNNRSYA LKLSRQALSD ARSALSNLES FASKRIEPMV RKEEIKELIK HAKTVLESLN 
KK 

tll-12. aa 

CNLDSKLSSNKEQKNNNNVKEVSDSVQEDGLNDLYMSIQEKQKSFTKNFGERKYE 

NISIAHTEKKETKKENLIPSTNEEKEAPAAIKYLEENILKNSKFSELIREVRVIKDEYALIKADLYDVIGKINNKK 
TSLMENPK^RDKINKLTQLLQNNLKIDSELEQLINMIDMAENEISSAAFFFDNAQKRLKESIIKRLESKNNRSYA 
LKLSRQALSDARSALSNLESFASKRIEPMVRKEEIKELIKHAKTVLESLNKK 



TAAAGGAGTT TACAAATGAG TAAACTAATA TTGGCAATAT CTATACTGCT AATAATTTCA 
TGTAAATGGT ATGTAGACAA TAC C ATT GAT GAAGCAACTG TAGAAAGTAA ' ATCAGCACTA 
ACATCTATTG ATCAAGTATT AGATGAGATA AGTGAAGCCA CAGGCCTAAG TTCGG AAAAA 
ATCACAAAAT TAACTCCGGA AGAGCTAGAA AATTTAGCAA AGGAAGC TC A AGATGACTCT 
'GAAAAATCCA AAAAAGAAAT TGAAGATCAA AAAAATAC C A AGGAAAGTAA AAACATAGAA 
GTAAAGGATA CTCCTCGCTT AATCAAATTG ATAAAGAATT CATC AG AAAA AATTGATTCG 
GTTTTTCAAA CACTAATTAA TATAGGTTAT AATGCTACCT ATGCAGCCAA AAGTAATTTG 
AAGAATGGAC TAAAGATGGT GAAATTACTG GATGAGTTGC TAAAAATATC GGTAAGTAGC 
AATGGTGATA AAAGTACCCA AAAATACAAT GAACTTAAAA CCGTTGTAAA TAAGTTTAAT 
GCTGAAAATT CGGTAAGCGT TTCTTTTAAA GAACATTCAA AC AG T AAAAT TGAAACTAAA 
AAATGTATTC AAACTCTTAT GAAAAATGTA GAAACATACT TTGAAGGTGT ATGCAGCGAA 
CTTAAAAACA AAAATGATGG TGAGTACGAA AAAACATTGA CAACTTTAAG CTAA 



ATGTAAATGGTATGTAGACAATACCATTGATGAAGCAACTGTAGAAAGTAAATCAGCACTAACATCTATTGATCAA 
GTATTAGATGAGATAAGTGAAGCCACAGGCCTAAGTTCGGAAAAAATCACAAAATTAACTCCGGAAGAGCTAGAAA 
ATTTAGCAAAGGAAGCTCAAGATGACTCTGAAAAATCCAAAAAAGAAATTGAAGATCAAAAAAATACCAAGGAAAG 
TAAAAACATAGAAGTAAAGGATACTCCTCGCTTAATCAAATTGATAAAGAATTCATCAGAAAAAATTGATTCGGTT 
TTTCAAACACTAATTAATATAGGTTATAATGCTACCTATGCAGCCAAAAGTAATTTGAAGAATGGACTAAAGATGG 
TGAAATTACTGGATGAGTTGCTAAAAATATCGGTAAGTAGCAATGGTGATAAAAGTACCCAAAAATACAATGAACT 
TAAAACCGTTGTAAATAAGTTTAATGCTGAAAATTCGGTAAGCGTTTCTTTTAAAGAACATTCAAACAGTAAAATT 



f 11-4 .nt 



tll-4.nt 
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GAAACTAAAAAATGTATTCAAACTCTTATGAAAAATGTAGAAACATACTTTGAAGGTGTATGCAGCGAACTTAAAA 
ACAAAAATGATGGTGAGTACGAAAAA 

fll-4.aa 

RSLQMSKLIL AISILLIISC KWYVDNT IDE ATVESKSALT SIDQVLDEIS EATGLSSEKI 
TKLTPEELEN LAKEAQDDSE KSKKEIEDQK NTKESKNIEV KDTPRLIKLI KNSSEKIDSV 
FQTLINIGYN ATYAAKSNLK NGLKMVKLLD ELLKISVSSN GDKSTQKYNE LKTWNKFNA 
ENSVSVSFKE HSNSKIETKK CIQTLMKNVE TYFEGVCSEL KNKNDGEYEK TLTTLS 

tll-4 .aa 

CKWYVDNTIDEATVESKSALTSIDQVLDEISEATGLSSEKITKLTPEELENLAKSAQDOSEKSKKEIEDQKNTKES 
KNIEVKDTPRLIKLIKNSSEKIDSVFQTLINIGYNATYAAKSNLKNGLKMVKLLDELLKISVSSNGDKSTQKYNEL 
KTWNKFNAENSVSVSFKEHSNSKIETKKCIQTLMKNVETYFEGVCSELKNKNDGEYEK 

fl!2-l.nt 

TGAATCTCTA AAGATTTTAG CAGGGGAGAA AATATGAAAA AAAGTTTTTT ATCAATATAC 
ATGTTAATTT CAATAAGTTT ATTATCATGT GATGTTAGTA GATTAAATCA GAGAAATATT 
AATGAGCTTA AAATTTTTGT TGAAAAGGCC AAGTATTATT C T AT AAAATT AGACGCTATT 
TATAACGAAT GTACAGGAGC ATATAATGAT ATTATGACTT ATTCGGAAGG TACATTTTCT 
GATCAAAGTA AGGTTAATCA AGCTATATCT ATATTTAAAA AAGACAATAA AATTGTTAAT 
AAGTTTAAGG AGCTTGAAAA GATTATAGAA GAATACAAAC CTATGTTTTT AAGTAAATTA 
ATTGATGATT TTGCGGGATC CGTT 

tll2-l.nt 

ATGTGATGTTAGTAGATTAAATCAGAGAAATATTAATGAGCTTAAAATTTTTGTTGAAAAGGCCAAGTATTATTCT 
ATAAAATTAGACGCTATTTATAACGAATGTACAGGAGCATATAATGATATTATGACTTATTCGGAAGGTACATTTT 
CTGATCAAAGTAAGGTTAATCAAGCTATATCTATATTTAAAAAAGACAATAAAATTGTTAATAAGTTTAAGGAGCT 
TGAAAAGATTATAGAAGAATACAAACCTATGTTTTTAAGTAAATTAATTGATGATTTT 

fll2-l.aa 

ISKDFSRGEN MKKSFLSIYM LISISLLSCD VSRLNQRNIN ELKIFVEKAK - YYSIKLDAIY 

NECTGAYNDI MTYSEGTFSD QSKVNQAISI FKKDNKIVNK FKELEKIIEE YKPMFLSKLI 
DDFAGSV 

tll2-l.aa 

CDVSRLNQRNINELKIFVEKAKYYSIKLDAIYNECTGAYNDIMTYSEGTFSDQSKVNQAISIFKKDNKIWKFfCEL 
EKIIESYKPMFLSKLIDDF 

fl4-8.nt 

TAAATACAGA GCCATTCAAG GAGAGTATTT ATGAAATACT ATATATGTGT GTGTGTTTTT 
TTGCTTTTGA ATGC TTGC AA TTCAGATTTT AGCACTAATC AAGAAGATAT TAAATATCCA 
TCTGATAAAG AGAAATCAAA ATCCAACATG GAAGCAAGCT CTAAAGAAGA AGATC CAAAT 
AAAAAAATAA AAAAT AC AC T GCTTAATGAT TTAATAAATT TGATAGAAAT AGC TAATGAG 
CATAAAGAAA AATATGAAAA AAGAATGCAA GAAGAACCTT CAGATCAATA CGGAATATTG 
GCTTTCCAGG AATTAGACTT GTCC GTTGGA AAAAT AT C TG AAGACACCCC GCAATCTAAA 
AAATTTAGAA AAAACACCTA TTCTCCCTTA AGCGCTATTG ATGTCAATAA AT T AAAAG AT 
CTTTCAGAGA TTATAAGAAA TTCGGGCCAA ATACAAGGTT TATTTAATAT TTTCAACAGA 
TTCGGAGGCA TTTTTGACGA CTCACTTAAT CACGTATATT CTAAAAAAGA TATCCTAGGG 
GGACTAGAAA TTTTGGATTT AGATAAACTA AAAAATTCGT TTGAAAAATT ACTATCTATA 



• 
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AAAGAAACTT TCTCAAAAAT GCTAAATCAA CTTTTATTAG ATTATAAAAA TGATAAAGAT 
CATATACGAA CAGAGACAAA TAAACTTAAA TCTCATACAA CTGCACTTTT CGAACAACTT 
GATAAAAAAG AAGACGAAGC ATATGAACCT AAAAATCAGA TATTTTCAAT AAGTAACCTT 
TAA 



TTGCAATTCAGATTTTAGCACTAATCAAGAAGATATTAAATATCCATCTGATAAAGAGAAATCAAAATCCAACATG 
GAAGCAAGCTCTAAAGAAGAAGATCCAAATAAAAAAATAAAAAATACACTGCTTAATGATTTAATAAATTTGATAG 
AAATAGCTAATGAGCATAAAGAAAAATATGAAAAAAGAATGCAAGAAGAACCTTCAGATCAATACGGAATATTGGC 
TTTCCAGGAATTAGACTTGTCCGTTGGAAAAATATCTGAAGACACCCCGCAATCTAAAAAATTTAGAAAAAACACC 
TATTCTCCCTTAAGCGCTATTGATGTCAATAAATTAAAAGATCTTTCAGAGATTATAAGAAATTCGGGCCAAA'TAC 
AAGGTTTATTTAATATTTTCAACAGATTCGGAGGCATTTTTGACGACTCACTTAATCACGTATATTCTAAAAAAGA 
TATCCTAGGGGGACTAGAAATTTTGGATTTAGATAAACTAAAAAATTCGTTTGAAAAATTACTATCTATAAAAGAA 
ACTTTCTCAAAAATGCTAAATCAACTTTTATTAGATTATAAAAATGATAAAGATCATATACGAACAGAGACAAATA 
AACTTAAATCTCATACAACTGCACTTTTCGAACAACTTGATAAAAAAGAAGACGAAGCATATGAACCTAAAAATCA 
G 



IQSHSRRVFM KYYICVCVFL LLNACNSDFS TNQEDIKYPS DKEKSKSNME ASSKEEDPNK 
KIKNTLLNDL INLIEIANEH KEKYEKRMQE EPSDQYGILA FQELDLSVGK ISEDTPQSKK 
FRKNTYSPLS AIDVNKLKDL SEIIRNSGQI QGLFNIFNRF ' GGIFDDSLNK VYSKKDILGG 
LETXDLDKLK NSFEKLLSIK ETFSKMLNQL LLDYKNDKDH IRTETNKLKS HTTALFSQLD 
KKEDEAYEPK NQIFSISNL 



CNSDFSTNQEDIKYPSDKEKSKSNMEASSKEEDPNKKIKNTLLNDLINLIEIANEHXEKYEKRMQEEPSDQYGILA 
FQELDLSVGKISEDTPQSKKFRKNTYSPLSAIDWKLKDLSEIIRNSGQIQGLFNIFNRFGGIFDDSLNHVYSKKD 
ILGGLEILDLDKLKNSFEKLLSIKETFSKMLNQLLLDYKjMDKDHIRTETNKLKSHTTALFEQLDKKEDEAYEPKNQ 



TAAAGGAGGG TATTTATGAA AT AC C AC ATA ATTACAACTA TATTTGTTTT TCTGTTTTTA 
GCTTC-CAGGC CGGATTTTAA TATCGATCAA AAAGACATTA AATACCCGCC TACTGAAAAA 
TCAAGGCCCA AAACTGAAAG CTCTAAGCAA AAAGAATCAA AGCCTAAAAC AGAAGAAGAG 
CTTAAGAAAA AACAACAAGA AGAAGAGCTT AAGAAAAAAC AACAAGAAGA AGAGCTTAAG 
AAAAAACAAC AAGAAGAAGA GCTTAAGAAA AAACAACAAG AAGAAGAGAA GGAAGAACTA 
AGAAAACAAC AACTAAAAAA TACGCTATCT AATGATTTAA AAAAGCAAAT AGAATCGGCC 
TACAATTTTA AAGAAAAATA TGTAAAAAGT ATGGAAAAAG AACCTGAAGA CCATTACGGG 
ATGACGTCTT TTAGGGGATT GAATTGGGGG CCAGGGACTG AAGATATATC TGACAATACC 
GAAAGATC T A TAAGATATAG AAGAC AC AC T TATACTGTTT TAAGCCCCCT GGATCCTCAT 
GAATTAAAGG AATTCGCAAA TATTATTCAA GATATAAATA AACTAGCATC AGTAGCAAGT 
ATATTTAATT CTTTTAGCGC TATTGGAGGA GCTCTTGACA TAGTAAGTGA TCACCTATAT 
TTCAAAAAAG ACAATCTAGA CAAACTAGAT ATTGCAGATT TAGAAATACT TAAAAATTCA 
TTTGAACAAA TATTATATAT AAAAGGAAGT GTTGCAGGAA AAGCAAAAAA ACTTTTATTA 
GATTATAAAA ATCTAAAAAC AGATATTAAT AAGCTTAAAT C TTATTC AAA TGAAC TGGTT 
AATGGAATTA AGCAACAAGC TCTAGAAGCA G AAAAT C TAG AAGAGCTTAT AGTGTCAAAA 
TATAAACTTT AA 



tl4-8 .nt 



f 14-8 .aa 



tl4-8 .aa 



f 17-6. nt 



t!7-6 .nt 



TTGCAGGCCGGATTTTAATATCGATCAAAAAGACATTA^A.TACCCGCCTACTGAAAAATCAJ^GGCCCAAAACTGAA 
AGCTCTAAGCAAAAAGAATCAAAGCCTAAAACAGAAGAAGAGCTTAAGAAAAAACAACAAGAAGAAGAGCTTAAGA 
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AAAAACAACAAGAAGAAGAGCTTAAGAAAA^ 

GGAAGAACTAAGAAAACAACAACTAAAAAATACGCTATCTAATGATTTAAAAAAGCAAATAGAATCGGCCTACAAT 
TTTAAAGAAAAATATGTAAAAAGTATGGAAAAAGAACCTGAAGACCATTACGGGATGACGTCTTTTAGGGGATTGA 
ATTGGGGGCCAGGGACTGAAGATATATCTGACAATACCGAAAGATCTATAAGATATAGAAGACACACTTATACTGT 
TTTAAGCCCCCTGGATCCTCATGAATTAAAGGAATTCGCAAATATTATTCAAGATATAAATAAACTAGCATCAGTA 
GCAAGTATATTTAATTCTTTTAGCGCTATTGGAGGAGCTCTTGACATAGTAAGTGATCACCTATATTTCAAAAAAG 
ACAATCTAGACAAACTAGATATTGCAGATTTAGAAATACTTAAAAATTCATTTGAACAAATATTATATATAAAAGG 
AAGTGTTGCAGGAAAAGCAAAAAAACTTTTATTAGATTATAAAAATCTAAAAACAGATATTAATAAGCTTAAATCT 

TATTCAAATGAACTGGTTAATGGAATTAAGCAACAAGCTCTAGAAGCAGAAAATCTAGAAGAGCTTATAGTGTCAA 
AATATAAACTT 

fl7-6.aa 

RRVFMKYHII TTIFVFLFLA CRPDFNIDQK DIKYPPTEKS RPKTESSKQK ESKPKTEEEL 
KKKQQEEELK KKQQEEELKK KQQEEELKKK QQEEEKEELR KQQLKNTLSN DLKXQIESAY 
NFKEKYVKSM EKEPEDHYGM TSFRGLNWGP GTEDISDNTE RSIRYRRHTY TVLSPLDPHE 
LKEFANIIQD INKLASVASI FNSFSAIGGA LDIVSDHLYF KKDNLDKLDI ADLEILKNSF 
EQILYIKGSV AGKAKKLLLD YKNLKTDINK LKSYSNELVN GIKQQALEAE NLEELIVSKY 
KL 

tl7-6 .aa 

CRPDFNIDQKDIKYPPTEKSRPKTESSKQKESKPKTEEELKKKQQEEELKKKQQEEELKKKQQEEELKKKQQEEEK 
EELRKQQLKNTLSNDLKKQIESAYNFKEKWKSMEKEPEDHYGMTSFRGLNWGPGTEDISDNTERSIRYRRHTYTV 
LSPLDPHELKEFANIIQDINKLASVASIFNSFSAIGGALDIVSDHLYFKKDNLDKLDIADLEILKNSFEQILYIKG 
SVAGKAKKLLLDYKNLKTDINKLKSYSNELVNGIKQQALEAENLEELIVSKYKL 

f!9-2 .nt 

TAAAGAAAGA TTAAATCATA TTCAAGGAGA GTATTTATGA AACACTATAT AATTGTGCAT 
ATATTTGTTT TTCTATTTTT AAATGCTTGT TATCCAGTTG CATCTAATAA AATAGAATTA 
AAACCTAAAA CAGAAACAAG CTTAAATCAA GAAGAAGTCC CAAATCAAGA AGCAAACTAC 
AAAGAAGAAA AAGAAGC AAA AGAAGAAGGC ATTAATAAAA AAACAGAAAA CACGCTGCTT 
AATGATTTAA GAAATTTAAT AGAAACAGCT AAAAAAGATA ATGATAAATA TACACAAAAG 
TTAAAAGAAG AATCCTCAAG CCAATACGGA ATACTGGCTT TCAAAGATTT GTTCTGGCTA 
GATGGAACAA ATGAACAATT GTCCGCAAAT ACCGAAAGAT CTAAAGCCTA • TAGAAAACGA 
GCTTATAGCA TCTTAAATAC TATTAATGAC GCTTCCTTAA AGAATTTTTC AGAAATTGTA 
ATGGCATCAG GACAAACACA GGGCATATTT AATACCCTTA ACTCACTTGG GGGTAATTTT 
GAAAAGATAG TTAATTGTTT GT AT C C C AAA AAAGACAATT TGGAAAAATT AGAGACTTCA 
GTTTTAAAAA AGCTTAAAGA TTCTTTGGAA AATTTTTTAG AGATAAAAAA AATCGCCTCA 
GAAATGATGC ACAAGCTCTT ATTAGACTAT CAAAATAATA CAAATCGTAT ACAAACAGAT 
AAAAATGAAC TTAAGTCTTA TGCAGACACA CTTTTCAATC AAATGACAAA AAAACC CGAA 
GAAGCACTAA AGCTAAAAAA TACCATATGC TCAATAGAGG ACCTTTAA 

tl9-2.nt 

TTGTTATCCAGTTGCATCTAATAAAATAGAATTAAAACCTAAAACAGAAACAAGCTTAAATCAAGAAGAAGTCCCA 
AATCAAGAAGCAAACTACAAAGAAGAAAAAGAAGCAAAAGAAGAAGGCATTAATAAAAAAACAGAAAACACGCTGC 
TTAATGATTTAAGAAATTTAATAGAAACAGCTAAAAAAGATAATGATAAATATACACAAAAGTTAAAAGAAGAATC 
CTCAAGCCAATACGGAATACTGGCTTTCAAAGATTTGTTCTGGCTAGATGGAACAAATGAACAATTGTCCGCAAAT 
ACCGAAAGATCTAAAGCCTATAGAAAACGAGCTTATAGCATCTTAAATACTATTAATGACGCTTCCTTAAAGAATT 
TTTCAGAAATTGTAATGGCATCAGGACAAACACAGGGCATATTTAATACCCTTAACTCACTTGGGGGTAATTTTGA 
AAAGATAGTTAATTGTTTGTATCCCAAAAAAGACAATTTGGAAAAATTAGAGACTTCAGTTTT.AAAAAAGCTTAAA 
GATTCTTTGGAAAATTTTTTAGAGATAAAAAAAATCGCCTCAGAAATGATGCACAAGCTCTTATTAGACTATCAAA 
ATAATACAAATCGTATACAAACAGATAAAAATGAACTTAAGTCTTATGCA.GACACACTTTTCAATCAAATGACAAA 
AAAACC C G AAGAAGC AC T AAAG 



189 

TABLE 1 . Nucleotide and Amino Acid Sequences 



fl9-2 .aa 

RKIKSYSRRV FMKHYIIVHI FVFLFLNACY PVASNKIELK PKTETSLNQE EVPNQEANYK 
EEKEAKEEGI NKKTENTLLN DLRNLIETAK KDNDKYTQKL KEESSSQYGI LAFKDLFWLD 
GTNEQLSANT ERSKAYRKRA YSILNTINDA SLKNFSEIVM ASGQTQGIFN TLNSLGGNFE 
KIWCLYPKK DNLEKLETSV LKKLKDSLEN FLEIKKIASE MMKKLLLDYQ NNTNRIQTBK 
NELKSYADTL FNQMTKKPEE ALKLKNTICS IEDL 

t.19-2 - aa 

CYPVASNKIELKPKTETSLNQEEVPNQEANYKEEKEAKEEGINKKTENTLLNDLRNLIETAKKDNDKYTQKLKEES 
SSQYGILAFKDLFWLDGTNEQLSANTERSKAYRKRAYSILNTINDASLKNFSEIVMASGQTQGIFNTLNSLGGNFE 
KIVNCLYPKKDNLEKLETSVLKKLKDSLENFLEIKKIASEMMHKLLLDYQNNTNRIQTDKNELKSYADTLFNQMTK 
KPEEALK 

fl9-4.nt 

TAATCTATAC TAATTGAGGA GAATATTTTT ATGAAAAACA ACATAATTTT ATGCATGTGT 
GTTTTTTTAC TTTTAAATAG CTGCACCGCT AACCATGAAG CTGAAGCGAA AATAAAAAAA 
CATGTTGATA AAACAAAAAA CGAATATATT AATGAAATAA AAAATTTAAT AGCAACAACC 
AAAGAAATCA TCGAAAAACG AAAATTGCTA CAAGCTAAAC CAGTAGATCA AAACCCCGTA 
GATGATACAA ACAATAAGAA AGTTTTCGAG ATAGATAAAA GAGCTTTCGA TTTTATAAAT 
AGTTTTTTAA CAGATGATGA ATTTAATAAA TTTGTAACAA TATTTCATAA ACCAACACTA 
AAATCACCCG GAAAAGTATT AAATAGCATA GC AATTC TAG AGCTAAACAT AGAGCAGGTA 
ATTAATCACC TAGACTCAAA AAATGAGACC TTAAATAAAG CAAGCTCTTT AG ATT TGGAA 
AAGATCAAAA ATTCCCTTGA ACAGCTGTTC TCTATAAGGA ATTTTTTTTC AACAATCATA 
AAAAGGGTCT TATTAGATCA TCAAAACAAT GAAAATTCTA TAAAACCAGA TGATTCTAAA 
. TCAGGAACCT ATTTC GAT AC GATATACGAT CAGTTTAATG AAAAAAATAA AGAGGTTAGA 
AATCTGAAAA AAACCATATT ATCACTGCCG AATTAA 

tl9-4.nt 

CTGCACCGCTAACCATGAAGCTGAAGCGAAAATAAAAAAACATGTTGATAAAACAAAAAACGAATATATTAATGAA 
AT AAAAAATTTAAT AGCAACAACC AAAGAAATCATCGAAAAACGAAAATTGCTACAAGCTAAACCAGTAGATCAAA 
ACCCCGTAGATGATACAAACAATAAGAAAGTTTTCGAGATAGATAAAAGAGCTTTCGATTTTATAAATAGTTTTTT 
AACAGATGATGAATTTAATAAATTTGTAACAATATTTCATAAACCAACACTAAAATCACCCGGAAAAGTATTAAAT 
AGCATAGCAATTCTAGAGCTAAACATAGAGCAGGTAATTAATCACCTAGACTCAAAAAATGAGACCTTAAATAAAC- 
CAAGCTCTTTAGATTTGGAAAAGATCAAAAATTCCCTTGAACAGCTGTTCTCTATAAGGAATTTTTTTTCAACAAT 
CATAAAAAGGGTCTTATTAGATCATCAAAACAATGAAAATTCTATAAAACCAGATGATTCTAAATCAGGAACCTAT 
TTCGATACGATATACGATCAGTTTAATGAAAAAAATAAAGAGGTTAC-AAATCTGAAAAAA 

fl9-4 .aa 

SILIEENIFM KNNIILCMCV FLLLNSCTAN HEAEAKIKKH VDKTKNEYIN EIKNLIATTK 

EIIEKRKLLQ AKPVBQNPVB DTNNKKVFEI DKRAFDFINS FLTDDEFNKF VTIFHKPTLK 

SPGKVLNSIA ILELNIEQVI NHLDSKNETL NKASSLDLEK IKNSLEQLFS IRNFFSTIIK 

RVLLDHQNNE NSIKPDDSKS GTYFDTIYDQ FNEKNKEVRN LKKTILSLPN 

tl9-4.aa 

CTANHEAEAKIKKHVDKTKNEYINEIKNLIATTKEIIEKRKLLQAKPVDQNPVDDTNNKKVFEIDKRAFDFINSFL 
TDDEFNKFVTIFHKPTLKSPGKVLNSIAILELxMIEQVINHLDSKNETLNKASSLDLEKIKNSLEQLFSIRNFFSTI 
IKRVIiLDHQNNENSIKPBBSKSGTYFDTIYDQFNEKNKEVRNLKK 



fl9-6.nt 
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TAAAGGAGAG TATTAATGAA ATGCCATATA ATTGCAACTA TATTTGTTTT TCTATTTTTA 
GCTTGCAGTA- CAGATTTTAA TACTGATCAA AAAGGCATTA AATACCCGCC TAC CGAAAAA 
TCAAAGC CCA AAACTGAAGA CTCTAAGCAA AAAGAATTAA AGCCTAAAAC AGAAAAAGAA 
CTAAAGAAAA AACAACAACT AAAAAATAAA CTACTTAATG ATTTAAAAAA TTCAATAGAA 
ACAGCTAATA AGCATAAAGA AAAGTATAAA AAAAGAATGA AAGAAGAACC CGAAGATCAA 
TACGGGGTAC AGGCTTTCAA AGGATCGAAT TGGGGGCCGG GGACTGAAGA TGTATCTGCC 
AACACCGAAA GATCTATAAG ATTTAGAAGA CATACTTATA CTATTTTAAG CACGCTGAGT 
CTTCATGAAT TAAAGGAATT CTCAAATATT GTTACAAATG AAAATAAACT GGTGCCAGTA 
GTAGATATGT TTAATTTCTT TAGCTCTATT GGGACAGCTC TTGATATAAC AACCGATAGC 
TTATATCCCA AAAAGACAAT C TGGAC AAAC CAGATCTGTC GGATTTAG 

tl9-6 .nt 

TTGCAGTACAGATTTTAATACTGATCAAAAAGGCATTAAATACCCGCCTACCGAAAAATCAAAGCCCAAAACTGAA 
GACTCTAAGCAAAAAGAATTAAAGCCTAAAACAGAAAAAGAACTAAAGAAAAAACAACAACTAAAAAATAAACTAC 
TTAATGATTTAAAAAATTCAATAGAAACAGCTAATAAGCATAAAGAAAAGTATAAAAAAAGAATGAAAGAAGAACC 
CGAAGATCAATACGGGGTACAGGCTTTCAAAGGATCGAATTGGGGGCCGGGGACTGAAGATGTATCTGCCAACACC 
GAAAGATCTA-TAAGA.TTTAGAAGACATACTTATACTATTTTAAGCACGCTGAGTCTTCATGAATTAAAGGAATTCT 
CAAATATTGTTACAAATGAAAATAAACTGGTGGCAGTAGTAGATATGTTTAATTTCTTTAGCTCTATTGGGACAGC 
TCTTGATATAACAACCGATAGCTTATATCCCAAAAAGACAATCTGGACAAACCAGATCTGTCGG 

fl9-6.aa 

RRVLMKCH I I ATIFVFLFLA CSTDFNTDQK GIKYPPTEKS KPKTEDSKQK ELKPKTEKEL 

KKKQQLKNKL LNDLKNSIET ANKHKEKYKK RMKEEPEDQY GVQAFKGSNW GPGTEDVSAN 

TERSIRFRRH TYTILSTLSL HELKEFSNIV TNENKLVPW DMFNFFSSIG TALDITTDSL 

YPKKTIWTNQ ICRI 

tl9-6.aa 

CSTDFNTDQKGIKYPPTEKSKPKTEDSKQKELKPKTEKELKKKQQLKNKLLNDLKNSIETANKHKEKYKKP.MKEEP 
EDQYGVQAFKGSNWGPGTEDVSANTERSIRFRRHTYTILSTLSLHELKEFSNIVTNENKLVPWDMFNFFSSIGTA 
LDITTDSLYPKKTIWTNQICR 

f21-4 .nt 

TAGGAGACAA TCTTTATGAA TAAAAAAATA AAAATGTTTA TTATTTGTGC TATTTTTATG 
CTGATAAGTT CTTGTAAGAA TGATGTAACT AGTAAAGATT TAGAAGGGGC GGTGAAAGAT 
TTAGAAAGTT CAGAACAAAA TGTAAAAAAA ACAGAACAAG AGATAAAAAA ACAAGTTGAA 
GGATTTTTAG AAATTTTAGA GACAAAAGAT TTAAACACAT TAGATACAAA AGAAATTGAA 
AAACAAATTC AAGAATTAAA GAATAAGATA GAAAAATTAG ACTCTAAAAA AACTTCTATT 
GAAACATATT CTGGGTATGA AGAAAAAATA AACAAAATAA AAGAAAAATT AAACGGAAAA 
GGACTTGAAG ATAAATTAAA TGAACTTTCA GAGAGCTTAA AAAAGAAAAA AGAGGAGAGA 
AAAAAAGCTT TACAAGAGGC TAAAAAGAAA TTTGAAGAGT ATAAAAACCA AGCTGAATCT 
GCAACTGGAG TAACGCATGG TTCTCAAGTC CAAAGACAAG GTGGTGTTGG ATTACAAGCT 
TGGCAGTGTG CTAATAGTTT GGGGTTTAAA AATATGACTA GTGGTAATAA- TAC TAGC GAT 
ATGACCAATG AAGTTATAAC TAATTCGCTT AAAAAGATTG AAGAAGAACT TAAAAATATT 
GGAGAAACTG TAGAAGGTAA AAAAGAATAA 

t21-4 .nt 

TTGTAAGAATGATGTAACTAGTAAAGATTTAGAAGGGGCGGTGAAAGATTTAGAAAGTTCAGAACAAAATGTAAAA 
AAAACAGAACAAGAGATAAAAAAACAAGTTGAAGGATTTTTAGAAATTTTAGAGACAAAAGATTTAAACACATTAG 
ATACAAAAGAAATTGAAAAJVCAAATTCAAGAATTAAAGAATAAGATAGAAAAATTAGACTCTAAAAAAACTTCTAT 
TGAAACATATTCTGGGTATGAAGAAAAAATAAACAAAATAAAAGAAAAJ^TTAAACGGAAAAGGACTTGAAGATAAA 



• 
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TTAftATGAACTTTCAGAGAGCTTAAAAAAGAAAAAAGAGGAGAGAAAAAAAGCTTTACAAGAGGCTAAAAAGAAAT 
TTGAAGAGTATAAAAACCAAGCTGAATCTGCAACTGGAGTAACGCATGGTTCTCAAGTCCAAAGACAAGGTGGTGT 
TGGATTACAAGCTTGGCAGTGTGCTAATAGTTTGGGGTTTAAAAATATGACTAGTGGTAATAATACTAGCGATATG 
ACCAATGAAGTTATAACTAATTCGCTTAAAAAGATTGAAGAAGAACTTAAAAATATTGGAGAAACTGTAGAAGGTA 
AAAAAGAA 



CKNDVTSKDLEGA^/KDLESSEQNVKKTEQEIKKQVEGFLSILETKDLNTLDTKEIEKQIQELiCNKIEKLDSKKTSI 
Wi ETYSGYEEKINKIKEKLNGKGLEDKLNELSESLKKKKEERKKALQEAKKKFEEYKNQAESATGVTHGSQVQRQGGV 
GLQAWQCANSLGFKNMTSGNNTSDMTNEVITNSLKKIEEELKNIGETVEGKKE 

Uj f24-l.nt 

\.l TAAGCTGGTA ACACTGTAAA GACAGCTGAG GGGGCTTCAA GTGGTACTGA TGCAATTGGA 
iij GAAGTTGTGG ATAATGATGC TAAGGTTGCT GATAAGGCGA GTGTGACGGG GATTGCTAAG 
□ GGGATAAAGG AGATTGTTGA AGCTGCTAGG GGGAGTGAAA AGCTGAAAGT TGCTGCTGCT 
s; AAAGAGGGCA ATGAAAAGGC AGGGAAGTTG TTTGGGAAGG CTGGTGCTAA TGCTCATGGG 

R!l GACAGTGAGG CTGCTAGCAA GGCGGCTGGT GCTGTTAGTG CTGTTAGTGG GGAGCAGATA 
... TTAAGTGCGA TTGTTAAGGC TGCGGATGCG GCTGAGCAGG ATGGAAAGAA GCCTGCAGAT 
fjjj GCTACAAATC CGATTGCTGC TGCTATTGGG AATAAAGATG AGGATGCGGA TTTTGGTGAT 
n,j GGGATGAAGA AGGATGATCA GATTGCTGCT GCTATTGCTT TGAGGGGGAT GGCTAAGGAT 
m GGAAAGTTTG CTGTGAAGAA TGATGAGAAA GGGAAGGCTG AGGGGGC TAT TAAGGGAGCT 
'H GCTGCAATTG GAGAAGTTGT GGATAATGCT GGTGCTGCGA AGGCTGCTGA TAAGGATAGT 
GTGAAGGGGA TTGCTAAGGG GATAAAGGAG ATTGTTGAAG CTGCTGGGGG GAGTGAAAAG 
CTGAAAGCTG CTGCTGCTGA AGGGGAGAAT AATAAAAAGG CAGGGAAGTT GTTTGGGAAA 
GTTGATGGTG CTGCTGGGGA CAGTGAGGCT GCTAGCAAGG CGGCTGGTGC TGTTAGTGCT 
GTTAGTGGGG AGCAGATATT AAGTGCGATT GTTAAGGCTG CTGGTGAGGC TGAGCAGGAT 
GGAGAGAAGC CTGAGGATGC TAAAAATCCG ATTGCTGCTG CTATTGGGAA - GGGTAATGGG 
GATGGTGCGG AGTTTGATCA GGATGAGATG AAGAAGGATG ATCAGATTGC TGCTGCTATT 
GCTTTGAGGG GGATGGCTAA GGATGGAAAG TTTGCTGTGA AGGGTAATAA TGAGAAAGAG 
AAGGC TGAGG GGGCTATTAA AGAAGTTAGC GAGTTGTTGG ATAAGCTGGT AACAGCTGTA 
AAGACAGCTG AGGGGGCTTC AAGTGGTACT GATGCAATTG GAGAAGTTGT GGATAATGNT 
GCNAAGGNTG CTGATAAGGC GAGTGTGACG GGGATTGCTA AGGGGATAAA GGAGATTGTT 
GAAGCTGCTN GGGGGAGTGA AAAGC TGAAA GTTGCTGCTG CTANAGNGGN NAATAATAAA 
GAGGCAGGGA AGTTGTTTGG GAAGGCTGGT GCTGATGCTA ATGGGGACAG TGAGGCTGCT 
AGCAAGGCGG CTGGTGCTGT TAGTGCTGTT AGTGGGGAGC AGATATTAAG TGCGATTGTT 
AAGGCTGCGG CTGCTGGTGC GGC TGATC AG GATGGAGAGA AGCCTGGGGA TGCTAAAAAT 
CCGATTGCTG CTGCTATTGG GAAGGGTAAT GCGGATGATG GTGCGGATTT TGGTGATGGG 
ATGAAGAAGG ATGATCAGAT TGCTGCTGCT ATTGCTTTGA GGGGGATGGC TAAGGATGGA 
AAGTTTGCTG TGAAGAAGGA TGAGAAAGGG AAGGCTGAGG GGGCTATTAA GGGAGCTAGC 
GAGTTGTTGG ATAAGCTGGT AAAAGC TGT A AAGACAGCTG AGGGGGCTTC AAGTGGTACT 
GCTGCAATTG GAGAAGTTGT GGATAATGCT GCGAAGGCTG CTGATAAGGA TAGTGTGACG 
GGGATTGCTA AGGGGATAAA GGAGATTGTT GAAGC TGC AG GGGGGAGTGA AAAGCTGAAA 
GTTGCTGCTG CTAAAGGGGA GAATAATAAA GGGGCAGGGA AGTTGTTTGG GAAGGCTGGT 
GCTAATGCTC ATGGGGACAG TGAGGCTGCT AGCAAGGCGG CTGGTGCTGT TAGTGCTGTT 
AGTGGGGAAC AGATATTAAG TGCGATTGTT AAGGCTGCTG GTGAGGCTGC TGGTGATCAG 
GAGGGAAAGA AGCCTGAGGA GGCTAAAAAT CCGATTGCTG CTGCTATTGG GGATAAAGAT 
GGGGATGCGG AGTTTAATCA GGATGGGATG AAGAAGGATG ATCAGATTGC TGCTGCTATT 



f21-4.aa 



ETIFMNKKIK 
FLEILETKDL 
LEDKLNELSE 
QCANSLGFKN 



MFIICAIFML ISSCKNDVTS KDLEGAVKDL ESSEQNVKKT EQEIKKQVEG 
NTLDTKEIEK QIQELKNKIE KLDSKKTSIE TYSGYEEKIN KIKEKLNGKG 
SLKKKKEERK KALQEAKKKF EEYKNQA2SA TGVTHGSQVQ RQGGVGLQAW 
MTSGNNTSDM TNEVITNSLK KIEEELKNIG ETVEGKKE 



t21-4 ,aa 
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GCTTTGAGGG GGATGGCTAA GGATGGAAAG TTTGCTGTGA AGGATGGTGG TGAGAAAGAG 
AAGGCTGAGG GGGCTATTAA AGGAGTTAGC GAGTTGTTGG ATAAGCTGGT AAAAGC TGT A 
AAGACAGCTG AGGGGGCTTC AAGTGGTACT GCTGCAATTG GAGAAGTTGT GGCTGATGCT 
GCTAAGGTTG CTGATAAGGC GAGTGTGACG GGGATTGCTA AGGGGATAAA GGAGATTGTT 
GAAGCTC-CTG GGGACAGTGA GGCTGCTAGC AAGGCAGCTG GTGCTGTTAG TGCTGTTAGT 
GGGGAGCAGA TATTAAGTGC GATTGTTAAG GCTGCGGCTG CTGGTGCGGC TGAGCAGGAT 
GGAGAGAAGC CTGCAGAGGC TAAAAATCCG ATTGCTGCTG CTATTGGGAA GGGTGATGGG 
GATGCGGATT TTGGTGAGGA TGGGATGAAG AAGGATGATC AGATTGCTGC TGCTATTGCT 
TTGAGGGGGA TGGCTAAGGA TGGAAAGTTT GCTGTGAAGA ATGATGAGAA AGGGAAGGCT 
GAGGGGGCTA TTAAGGGAGC TGCTGCAATT GGAGAAGTTG TGGATAATGC TGGTGCTGCG 
AAGGCTGCTG ATAAGGATAG TGTGAAGGGG ATTGCTAAGG GGATAAAGGA GATTGTTGAA 
GCTGCTGGGG GGAGTGAAAA GCTGAAAGCT GCTGCTGCTG AAGGGGAGAA TAATAAAAAG 
GCAGGGAAGT TGTTTGGGAA AGTTGATGGT GCTGCTGGGG ACAGTGAGGC TGCTAGCAAG 
GCGGCTGGTG CTGTTAGTGC TGTTAGTGGG GAGCAGATAT TAAGTGCGAT TGTTAAGGCT 
GCGGATGCGG CTGAGCAGGA TGGAAAGAAG CCTGCAGATG CTACAAATCC GATTGCTGCT 
GCTATTGGGA ATAAAGATGA GGATGCGGAT TTTGGTGATG GGATGAAGAA GGATGATCAG 
ATTGCTGCTG CTATTGCTTT GAGGGGGATG GCTAAGGATG GAAAGTTTGC TGTGAAGGGT 
AATAATGAGA AAGGGAAGGC TGAGGGGGCT TCAAGTGGTA CTGATGCAAT TGGAGAAGTT 
GTGGATAATG ATGC GAAGGC TGCTGATAAG GCGAGTGTGA CGGGGATTGC TAAGGGGATA 
AAGGAGATTG TTGAAGCTGC TGGGGGGAGT GAAAAGCTGA AAGCTGTTGC TGCTGCTACA 
AGGGAGAATA ATAAAGAGGC AGGGAAGTTG TTTGGGAAAG TTGATGATGC TCATGC TGGG 
GACAGTGAGG CTGCTAGCAA GGCGGCTGGT GCTGTTAGTG CTGTTAGTGG GGAGCAGATA 
TTAAGTGCGA TTGTTACGGC TGCGGCTGCT GGTGAGCAGG AT GGAGAGAA GCCTGCAGAG 
GCTACAAATC CGATTGCTGC TGCTATTGGG AAGGGTAATG AGGATGGTGC GGATTTTGGT 
AAG GAT GAGA TGAAGAAGGA TGATCAGATT GCTGCTGCTA TTGCTTTGAG GGGGATGGCT 
AAGGATGGAA AGTTTGCTGT GAAGAGTAAT GATGGTGAGA AAGGGAAGGC TGAGGGGGCT 
ATTAAGGAAG TTAGCGAGTT GTTGGATAAG CTGGTAAAAG CTGTAAAGAC AGC TGAGGGG 
GCTTCAAGCG GTACTGATGC AATTGGAGAA GTTGTGGCTA ATGCTGGTGC TGCGAAGGCT 
GCTGATAAGG CGAGTGTGAC GGGGATTGCT AAGGGGATAA AGGAGATTGT TGAAGCTGCT 
GGGGGGAGTA AAAAGCTGAA AGCTGCTGCT GCTGAAGGGG AGAATAATAA AAAGGCAGGG 
AAGTTGTTTG GGAAGGCTGG TGCTGGTGCT GGTGCTAATG GGGACAGTGA GGCTGCTAGC 
AAGGCGGCTG GTGCTGTTAG TGCTGGTTAG 

t2 4-1. nt 

TGGTGAGGCTGAGCAGGATGGAGAGAAGCCTGAGGATGCTAAAAATCCGATTGCTGCTGCTATTGGGAAGGGTAAT 
GGGGATGGTGCGGAGTTTGATCAGGATGAGATGAAGAAGGATGATCAGATTGCTGCTGCTATTGCTTTGAGGGGGA 
TGGCTAAGGATGGAAAGTTTGCTGTGAAGGGTAATAATGAGAAAGAGAAGGCTGAGGGGGCTATTAAAGAAGTTAG 
CGAGTTGTTGGATAAGCTGGTAACAGCTGTAAAGACAGCTGAGGGGGCTTCAAGTGGTACTGATGCAATTGGAGAA 
GTTGTGGATAATGNTGCNAAGGNTGCTGATAAGGCGAGTGTGACGGGGATTGCTAAGGGGATAAAGGAGATTGTTG' 
AAGCTGCTNGGGGGAGTGAAAAGCTGAAAGTTGCTGCTGCTANAGNGGNNAATAATAAAGAGGCAGGGAAGTTGTT 
TGGGAAGGCTGGTGCTGATGCTAATGGGGACAGTGAGGCTGCTAGCAAG 

f24-l .aa 

AGNTVKTAEG ASSGTDAIGE WDNDAKVAD KASVTGIAKG IKEIVEAARG SEKLKVAAAK 
EGNEKAGKLF GKAGANAHGD S EAASKAAGA VSAVSGEQIL SAIVKAADAA EQDGKKPADA 
TNPIAAAIGN KDEDADFGDG MKKDDQ I AAA IALRGMAKDG KFAVKNDEKG KAEGAIKGAA 
AIGEWDNAG AAKAADKDSV KGIAKGIKEI VEAAGGSEKL KAAAAEGENN KKAGKLFGKV 
DGAAGDSEAA SKAAGAVSAV SGEQIL3AIV KAAGEAEQDG EKPEDAKNPI AAA I GKGNGD 
GAEFDQDEMK KDDQIAAAIA LRGMAKDGKF AVKGNNEKEK AEGAIKEVSE LLDKLVTAVK 
"TAEGASSGTD AIGEWDNXA KXADKASVTG IAKGIKEIVS AAXGSEKLKV AAAXXXNNKE 
AGKLFGKAGA DANGDSEAAS KAAGAVSAVS GEQILSAIVK AAAAGAADQD GEKPGDAKN? 
I AAA I GKGNA DDGADFGDGM KKDDQIAAAI ALRGMAKDGK FAVKKDEKGK AEGAIKGASE 
LLDKLVKAVK TAEGAS SGTA AIGEWDNAA KAADKDSVTG IAKGIKEIVS AAGGSEKLKV 
AAAKGSNNKG AGKLFGKAGA NAHGDSEAAS KAAGAVSAVS GEQILSAIVK AAGEAAGDQE 
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GKKPEEAKNP IAAAIGDKDG DAEFNQDGMK KDDQ IAAAI A LRGMAXDGKF AVKDGGEKEK 
AEGAIKGVSE LLDKLVKAVK TAEGASSGTA AIGEWADAA KVADKASVTG IAKGIKEIVE 
AAGDSEAASK AAGAVSAVSG EQILSAIVKA AAAGAAEQDG EKPAEAKNPI AAAIGKGDGD 
ADFGEDGMKK DDQIAAAIAL RGMAKDGKFA VKNDEKGKAE GAIKGAAAIG EWDNAGAAK 
AADKDSVKGI AKGIKEIVEA AGGSEKLKAA AAEGENNKKA GKLFGKVDGA AGDSEAASKA 
AGAVSAVSGE QILSAIVKAA DAAEQDGKKP ADATNPIAAA IGNXDEDADF GDGMKKDDQI 
AAAIALRGMA KDGKFAVKGN NEKGKAEGAS SGTDAIGEW DNDAKAADKA SVTGIAKGIK 
EIVEAAGGSE KLKAVAAATR ENNKEAGKLF GKVDDAHAGD SEAASKAAGA VSAVSGEQIL 
SAIVTAAAAG EQDGEKPAEA TNP IAAAI GK GNEDGADFGK DEMKKDDQ I A AAIALRGMAK 
DGKFAVKSND GEKGKAEGAI KEVSELLDKL VKAVKTAEGA SSGTDAIGEV VANAGAAKAA 
DKASVTGIAK GIKEIVEAAG GSKKLKAAAA EGENNKKAGK LFGKAGAGAG ANGDSEAASK 
AAGAVSAG 

t24-l .aa 

GEAEQDGEKPEDAKNPIAAAIGKGNGDGAEFDQDSMKKDDQIAAAIALRGMAKDGKFAVKGNNEKEKAEGAIKEVS 
ELLDKLVTAVKTAEGASSGTDAIGEWDNXAKXADKASVTGIAKGIKEIVEAAXGSEKLKVAAAXXXNNKEAGKLF 
GKAGADANGDS EAASK 

f2 8-2 .nt 

TAAAAAGGAA ATATAAATAT TATGCGATTA TGTTTAATAA AAATTTTTAT TAT AC CTAAT 
TTAGTATTTA GTTCTCTTTT TTTATTTGAA AGTTGTTCTG GTTTTCTATC TAAAAAATCT 
ATAGAACAGT TTGCATTAGC ATTAAAAGAT CATCAAGAAA ATAAAAATAC TACTAATACT 
TCAGTAGATA AAAATAGTAA GGAAAT TGAA TCTCCTAAAG ACGTTACATC ATCAAATAAA 
AAAAC TTATG ATCCAATCTT ACAAGTAGGT TCTAATCAAC ATATGTCAGA TGATCCTGGT 
GCTAATAATA AAGAATCCCT ACCAAATTCA AGTCCAGCAA TAATACAAAA TGAC TCGCAT 
GCTCAAAATA ATGTAAAGAT GGAAGAAAAT AAATCAGCTA CTCCACAACA TGATCCAATT 
GAACAAAGTA ATTTTAAAAA TAGCCTTACT ACAACAAGTA AAACTCCTGC TATTCCTTCA 
GAAGAAGAAA TTAAAGCTAA CTTAGATGAA TTTGCACAAG AAGAGTATGA GCAAACATCT 
CTTTCAGAAA TTAAAAATGC CACGCAAATT GTTAATCATG CTAATCCTGA AAACAAATTA 
AACAATACAC TCCTTGAGTT TGAAAAAGAT TATGAAACTT TATCAAACTT GTTATTCTCT 
AATTTAGACG CATCTCCTTT GAATAGAAAA ATAAAGACTA TTATGCCTAA ATTACAAGAA 
ATGCGTTCTT TTATGGAGCA AGCAACTAAT TCTTGGGTAT CTGCTAAAGG CATGCTAGAT 
GAGGCTAAGG ATAAACTAGC AGAATCTATT TATAAAAGAC TATACAATGG CAATTCATAC 
CGGTTCGGTG GCAGTTTTAA CGGACGTGAT ATGCAACATG CAAAAAATTT -AGCATACAGA 
GCTATAGACT TTGCTTCTGC ATGCATTGAA TATACACAAA AAGCTATTGA TTATCTTCAA 
CAGGGAAATT CTTGCAAAAA AGAAATAGAA AATATATTCA AGCTTTAA 

t28-2 .nt 

AAAAGATCATCAAGAAAATAAAAATACTACTAATACTTCAGTAGATAAAAATAGTAAGGAAATTGAATCTCCTAAA 
GACGTTACATCATCAAATAAAAAAACTTATGATCCAATCTTACAAGTAGGTTCTAATCAACATATGTCAGATGATC 
CTGGTGCTAATAATAAAGAATCCCTACCAAATTCAAGTCCAGCAATAATACAAAATGACTCGCATGCTCAAAATAA 
TGTAAAGATGGAAGAAAATAAATCAGCTACTCCACAACATGATCCAATTGAACAAAGTAATTTTAAAAATAGCCTT 
ACTACAACAAGTAAAACTCCTGCTATTCCTTCAGAAGAAGAAATTAAAGCTAACTTAGATGAATTTGCACAAGAAG 
AGTATGAGCAAACATCTCTTTCAGAAATTAAAAATGCCACGCAAATTGTTAATCATGCTAATCCTGAAAACAAATT 
AAACAATACACTCCTTGAGTTTGAAAAAGATTATGAAACTTTATCAAACTTGTTATTCTCTAATTTAGACGCATCT 
CCTTTGAATAGAAAAATAAAGACTATTATGCCTAAATTACAAGAAATGCGTTCTTTTATGGAGCAAGCAACTAATT 
CTTGGGTATCTGCTAAAGGCATGCTAGATGAGGCTAAGGATAAACTAGCAGAATCTATTTATAAAAGACTATACAA 
TGGCAATTCATACCGGTTCGGTGGCAGTTTTAACGGACGTGATATGCAACATGCAAAAAATTTAGCA'TACAGAGCT 
ATAGACTTTGCTTCTGCATGCATTGAATATACACAAAAAGCTATTGATTATCTTCAACAGGGAAATTCTTGCAAAA 
AAGAAATAGAAAATATATTCAAG 



f28-2 .aa 
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KGNINIMRLC LIKIFIIPNL VFSSLFLFES CSGFLSKKSI EQFALALKDH QENKNTTNTS 
VDKNSKEIES PKDVTSSNKK TYDPILQVGS NQHMSDDPGA NNKESLPNSS PAIIQNDSHA 
QNNVKMEENK SATPQHDPIE QSNFKNSLTT TSKTPAIPSE EEIKANLDEF AQEEYEQTSL 
SEIKNATQIV NHANPENKLN NTLLEFEKDY ETLSNLLFSN LDASPLNRKI KTIMPKLQEM 
RSFMEQATNS WVSAKGMLDE AKDKLAESIY KRLYNGNSYR FGGSFNGRDM QHAKNLAYRA 
IDFASACIEY TQKAIDYLQQ GNSCKKEIEN IFKL 

t28-2 .aa 

kdhqenknttntsvdknskeiespkw^ 

vkmeenksatpqhdpieqsnfknsltttsktpaipseeeikanldefaqeeyeqtslseiknatqivnhanpenkl 
nntllefekdyetlsnllf snldas plnrk ikt imp klq emrs fmeq atnswvs akgmldeakdklaes i ykrlyn 
gnsyrfggsfngrdmqhaknlayraidfasacieytqkaidylqqgnsckkeienifk 

f28-3 .nt 

TAGATGAATT TAATTGCTAA ATTATTTATT TTATCCACTT TAGTTTCAAT TCCAAATATC 
CTCTCTTGTA ACCTATATGA TAATCTTGCA GACAACGCTG AGCAGGTTAC AGACATACTA 
GACAACAACA AGTCTTTTAA TACTTTAGGA AGCAGCAATG AGAGTAGAAG TCGCAGGCCT 
AGAAGTACAA ATAATGCTTA TATGAAACAA AACATAGACA AAAATCATTT AGTTGTTGCA 
GATATGCAAA ATGATAATAG TAGCAGCAGT CTTCCCCAAC AAGTTAATAG TGAATCCAGT 
AAAGCTAATG AAGATAGTAA TATT ATGAAG GAAATTGAAT CTTCTACAGA AGAGTGCGCT 
AGACTAAGAA AAGATTTAGA AACTATAAAA CAAATACTTG ATAATATAGA AAGCTTGCTT 
AATACAGCTA ATTCTTATTT AGAGAACGCT AGAAAAGCAC CTAAATCTAA TCAAGATAAT 
CAAACCTTAT TGCTTAGCCT GCACCAAGCT ATTGCTAAGG TTAAGAGTAG TCATACTTCT 
TTTATCATTT GTTATAATGA TGCATTTAAT TCCCTGGGAA TAGCTGATAC TGC C TTT AAA 
GATGCAAAGA GAAAGGCAGT TGAGGCATAA 

t28-3 .nt 

TTGTAACCTATATGATAATCTTGCAGACAACGCTGAGCAGGTTACAGACATACTAGACAACAACAAGTCTTTTAAT 
ACTTTAGGAAGCAGCAATGAGAGTAGAAGTCGCAGGCCTAGAAGTACAAATAATGCTTATATGAAACAAAACATAG 
ACAAAAATCATTTAGTTGTTGCAGATATGCAAAATGATAATAGTAGCAGCAGTCTTCCCCAACAAGTTAATAGTGA 
ATCCAGTAAAGCTAATGAAGATAGTAATATTATGAAGGAAATTGAATCTTCTACAGAAGAGTGCGCTAGACTAAGA 
AAAGATTTA.GAAACTATAAAACAAATACTTGATAATATAGAAAGCTTGCTTAATACAGCTAATTCTTATTTAGAGA 
ACGCTAGAAAAGCACCTAAATCTAATCAAGATAATCAAACCTTATTGCTTAGCCTGCACCAAGCTATTGCTAAGGT 
TAAGAGTAGTCATACTTCTTTTATCATTTGTTATAATGATGCATTTAATTCCCTGGGAATAGCTGATACTGCCTTT 
AAAGATGCAAAGAGAAAGGCAGTTGAGGCA 

f2 8-3 .aa 

MNLIAKLFIL STLVSIPNIL SCNLYDNLAD NAEQVTDILD NNKSFNTLGS SNESRSRRPR 
STNNAYMKQN I DKNHL WAD MQNDNSSSSL PQQVNSESSK ANEDSNIMKE IESSTEECAR 
LRKDLETTKQ ILDNIESLLN TANSYLENAR KAPKSNQDNQ TLLLSLHQAI AKVKSSHTSF 
IICYNDAFNS LGIADTAFKD AKRKAVEA 

t28-3.aa 

CNL YDNLADNAEQ VTD I LDNNKS FNT LGS SNE SRS RRPRS TNNAYMKQNIDKNHL WADMQNDNS S SSL PQQVNS E 
SSIKANEDSNIMKEIESSTESCARLRKDLETIKQILDNIESLLNTANSYLiENARKAPKSNQDNQTLLLSLHQAIAKV 
KSSHTSFII C YMDAFWS LGIADTAFKD AKRKAVEA 



TAAAAAAATA AGGAGGTATT AATGAAAAGG AAAAGCAATA TATGTATTTC ACTTCTAGTC 
ACAATATTAT TTGTGTCTTG CAAGTTTTTT GGAAATAAAA GCGCAAGTAA AGAAAAAGAA 
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GAAACTTCTT TTTCTGATAC TGCTAGCAAG ATTAGTAAGT CGGGAACAGC TGCTTCTTCA 

GACAAACAAG AAAAAAATAC AAGTGATGTT ACAGGTGACG CCAAAAAGCA TACTAGTAGC 

CCTTACATGC TTGCTGATGC CCTTATTGTT AGTGATACTA CTAATAGAGA TAGAGATAAG 

CAAGAAAATA AAGATAAATT AAATGAAGAA GATAAAAAAA AGCTTAATGC TTTTTTTAGC 

ACAACTAAAA CATATCAATC TAGC C TAG AT TCCATTTATA ACAAATATAC AGGCTATTAT 

AATACCATTG ATACCTATGG CAGCTGTGAT ACGTATCGCA TTGAGTGTTT TAGTGTAGGA 

CCTTCTGAAA AACGTAAACA AGCTCTTGCT GATCTAGAGA AGTTAAAACT AGACGAAAAG 

TACACTCAGC TTAGCACAAT GTTAAAGAGT GCTGTGCCTA GTTATTACAA AAAAAATTTA 

GATGATTCTA TTGCACAGTA TAAGGAAGCC ATAAAGCAGG CTATTGAAGC TGAAAGTAAA 

ATAGAGACAG TAAAAGACTA TGCAACAGCT CAAAGTGCTG CCGATGACGA AAAGAAAAGA 

AATATAGATA ATTTAAAAAT AGTTAGAGAT GTTCTTCTTA TTATTAAAAA AACTATTGAG 

AAAGCCAGCC GATCTTATGC TGATGCTTTT GCTATTGCAA CATCTAGCTT ATCTTGTAGC 

GAATTTAAGC AAGCTGTTAA AGAGTTTAAT GATGCTGCTA AACAATATGC TAATGGAAAT 

AAAGGAGACA ATGCTGTCAA TGTTATTGTA GGCACTATTT CTAGTATGCC TTATGTCAAA 

TTTAAAGATG AGTTTGCAAG AGCAAAAATG TTTGCTCGTA AT T AT AG AGG AGACGAGGTA 

GACAAGATGA TAAGAGCTAT CGACAAGCTG TGTGATGTTT ATAAAAAAGT TGCGCTTTAG 

t31-2 -nt 

TTGCAAGTTTTTTGGAAATAAAAGCGCAAGTAAAGAAAAAGAAGAAACTTCTTTTTCTGATACTGCTAGCAAGATT 
AGTAAGTCGGGAACAGCTGCTTCTTCAGACAAACAAGAAAAAAATACAAGTGATGTTACAGGTGACGCCAAAAAGC 
ATACTAGTAGCCCTTACATGCTTGCTGATGCCCTTATTGTTAGTGATACTACTAATAGAGATAGAGATAAGCAAGA 
AAATAAAGATAAATTAAATGAAGAAGATAAAAAAAAGCTTAATGCTTTTTTTAGCACAACTAAAACATATCAATCT 
AGCCTAGATTCCATTTATAACAAATATACAGGCTATTATAATACCATTGATACCTATGGCAGCTGTGATACGTATC 
GCATTGAGTGTTTTAGTGTAGGACCTTCTGAAAAACGTAAACAAGCTCTTGCTGATCTAGAGAAGTTAAAACTAGA 
CGAAAAGTACACTCAGCTTAGCACAATGTTAAAGAGTGCTGTGCCTAGTTATTACAAAAAAAATTTAGATGATTCT 
ATTGCACAGTATAAGGAAGCCATAAAGCAGGCTATTGAAGCTGAAAGTAAAATAGAGACAGTAAAAGACTATGCAA 
CAGCTCAAAGTGCTGCCGATGACGAAAAGAAAAGAAATATAGATAATTTAAAAATAGTTAGAGATGTTCTTCTTAT 
TATTAAAAAAACTATTGAGAAAGCCAGCCGATCTTATGCTGATGCTTTTGCTATTGCAACATCTAGCTTATCTTGT 
AGCGAATTTAAGCAAGCTGTTAAAGAGTTTAATGATGCTGCTAAACAATATGCTAATGGAAATAAAGGAGACAATG 
CTGTCAATGTTATTGTAGGCACTATTTCTAGTATGCCTTATGTCAAATTTAAAGATGAGTTTGCAAGAGCAAAAAT 
GTTTGCTCGTAATTATAGAGGAGACGAGGTAGACAAGATGATAAGAGCTATCGACAAG 

f31-2.aa 

KNKEVLMKRK SNICISLLVT ILFVSCKFFG NKSASKEKEE TSFSDTASKI SKSGTAASSD 

KQEKNTSDVT GDAKKHTSSP YMLADALIVS DTTNRDRDKQ ENKDKLNEED KKKLNAFFST 

TKTYQSSLDS IYNKYTGYYN TIDTYGSCDT YRIECFSVGP SEKRKQALAD LEKLKLDEKY 

TQLSTMLKSA VPSYYKKNLD DS I AQYKEAI KQAIEAESKI ETVKDYATAQ SAADDEKKRN 
IDNLKIVRDV LLIIKKTIEK ASRSYADAFA IATSSLSCSE FKQAVKEFND "AAKQYANGNK 

GDNAVNVIVG TISSMPYVKF KDEFARAKMF ARNYRGDEVD KMIRAIDKLC DVYKKVAL 

t31-2.aa 

CKFFGNKSASKEKSETSFSDTASKISKSGTAASSDKQEKNTSDVTGDAKKHTSSPYMLADALIVSDTTNRDRDKQE 
NKDKLNEEDKKKLNAFFSTTKTYQ'SSLDSIYNKYTGYYNTIDTYGSCDTYRIECFSVGPSEKRKQALADLEKLKLD 
EKYTQLSTMLKSAVPSYYKKNLDDSIAQYKEAIKQAIEAESKIETVKDYATAQSAADDEKKRNIDNLKIVRDVLLI 
IKKTIEKASRSYADAFAIATSSLSCSEFKQAVKEFNDAAKQYANGNKGDNAVNVIVGTISSMPWKFKDEFARAKM 
FARNYRGDEVDKMIRAIDK 

f32-4.nt 

TAAGGAAATA TGAGGAATAT TAGCAATTGT ATCAAATATA TTATATTAAC AATGCTTATT 

GGATTATTAA TTTTTTGTTG TGCAACCTTT GTTTGGTTGA TTGGAATTTT TTATTCAAAT 

AACTTTAAAG AAGAGCGGAA TTATTCAATA AGCCCAATAG ATAGTGTTAT TATGCGTAAA 

TGTTATTTTA AAGAATTTAA GTCTGGACTT ATTAAAAGCG TATTCTTTAA GAAATTAGAT 
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GTAAATGTTA ACTCTAAAAA TTTTAAGGAG CTAAATAAGG 

AATTCTTATC CATCTTATCA TATGGAGTTT GTCGTAGTTG 

TTTAAAAATG TTATTTTTAA TGGTATAGAT GATGCTAAAT 

GTTTACGGAG GATTTAGATA CTCAAAAGAG GCTTATTTCC 

GTTAAATTAA ATAAAATGAA ACAATATACT CCAGCAATTG 

AACATTAATG ATGCTTTATT TAACTCGTTA TTAAAGCAAA 

ATTTCCCATA ATAATAAAGA GTATATTTTA CAAACTAATA 

TTTCAAACAC CAGAAAAGGA GAATAGTTCT TACTAA 

t32-4.nt 

AAATAACTTTAAAGAAGAGCGGAATTATTCAATAAGCCCAATAGATAGTGTTATTATGCGTAAATGTTATTTTAAA 
GAATTTAAGTCTGGACTTATTAAAAGCGTATTCTTTAAGAAATTAGATGTAAATGTTAACTCTAAAAATTTTAAGG 
AGCTAAATAAGGTAGATAAACAAAATCTGCTAAATTCTTATCCATCTTATCATATGGAGTTTGTCGTAGTTGATAA 
TGGATTTTTAATGAATTTTAAAAATGTTATTTTTAATGGTATAGATGATGCTAAATTATACGATCAACGTGATATG 
GTTTACGGAGGATTTAGATACTCAAAAGAGGCTTATTTCCAAATTATTGGCAATTATGATGTTAAATTAAATAAAA 
TGAAACAATATACTCCAGCAATTGTAGTAAATGTTTTCAAAATTAACATTAATGATGCTTTATTTAACTCGTTATT 
AAAGCAAAAAACTTTAAAAGTTACTTTGATTTCCCATAATAATAAAGAGTATATTTTACAAACTAATAATTTCTTA 
TCAAAGTATAATTTTCAAACACCAGAAAAGGAGAATAGTTCTTAC 

f32-4.aa 

GNMRNISNCI KYIILTMLIG LLIFCCATFV WLIGIFYSNN FKEEPdMYSIS PIDSVIMRKC 

YFKEFKSGLI KSVFFKKLDV NVNSKNFKEL NKVDKQNLLN SYPSYHMEFV WDNGFLMNF 

KNVIFNGIDD AKLYDQRDMV YGGFRYSKEA YFQIIGNYDV KLNKMKQYTP AIWNVFXIN 

INDALFNSLL KQKTLKVTLI SHNNKEYILQ TNNFLSKYNF QTPEKENSSY 

t32-4.aa 

NNFKEERNYSISPIDSVIMRKCYFKEFKSGLIKSVFFKKLDVlSn/NSKNFKELNKVDKQNLLNSYPSYHMEFVWDN 
GFLMNFKIWIFNGIDDAKLYDQRDIWYGGFRYSKEAYFQIIGNYDVKLNKMKQYTPAIVVNVFKININDALFNSLL 
KQKTLKVTLI SHNNKEYI LQTNNFLS KYNFQTPEKENSS Y 

f4-15 -nt 

TAAATGAGCA AAAAAGTAAT TTTAATATTA CTAGAAATTT 
TCTATAAATA AAGAACAAAA AACCAAAGAA AAAACATCTG 
CAAAATATTG AAAAACAAGA GCCTGAAAAA CAGAAACAAA 
ACGGTATCAA TTCAAACGGT AGAAATAAGG GAATCAAATC 
AAGTACTACA AGCAAGCTTA TCCGATTCAA ACATTCACTC 
GAAAAGGAAT TTCTAAAACC AGAAGA'TAAA ATCTTGCCCA 
TTGAGCATCT TAATAAATAA AAAATTGTTA GACTTTAAAG 
TCAACTTTAA AAAATTTCAA AGAAATTAAA AATATTGAGA 
TTATTATTTG TCTTAACCCT TAAAGATAAA AATAACAAGA 
AATCCCCCAA ACGACATCCA AAAACCCAAA GATTATATTT 
ATTAAAAAGG GTACTGGTGA GAAATACTTA AATCCTATCT 
AAAAAAGATT ATCATTCAAT AGATTACAAC AAAGTGACTA 
TTGGAC C T AC TGCCTCACGA ACAAGTCTTT CAAATGAATA 
GACACAATAA CAGACTTAAA TAATCTAAAA TTAGTAATTC 

t4-15 -nt 

TTGTGATTTATCTATAAATAAAGAACAAAAAACCAAAGAAAAAACATCTGAAAAGCAAGAATCTGAAAAACAAAAT 
ATTGAAAAACAAGAGCCTGAAAAACAGAAACAAAATGCAGCAAAAATAATCCCTACGGTATCAATTCAAACGGTAG 
AAATAAGGGAATCAAATCAAATTCCAJiAAAGCATTGAGAAGTACTACAAGCAAGCTTATCCGATTCAAACATTCAC 
TCTTGATTTTAGCATCACAAGAGAAAAGGAATTTCTAAAACCAGAAGATAAAATCTTGCCCACACAGGGGAAAGTG 



TAGATAAACA AAATCTGCTA 
ATAATGGATT TTTAATGAAT 
TATACGATCA ACGTGATATG 
AAATTATTGG CAATTATGAT 
TAGTAAATGT TTTCAAAATT 
AAAC T TT AAA AGTTACTTTG 
ATTTCTTATC AAAGTATAAT 



TGATCTTGTC 
AAAAGCAAGA 
ATGCAGCAAA 
AAATT C C AAA 
TTGATTTTAG 
CACAGGGGAA 
CCCCAGAAAA 
ATTTCTTCCA 
AC ACT ATT AA 
TAAAAGACCT 
ATAGATTTCA 
TTAGC GAAAA 
AAAATTTCAC 
AAAAAGAATT 



TTGTGATTTA 
ATCTGAAAAA 
AATAATCCCT 
AAGCATTGAG 
CATCACAAGA 
AGTGGAGTCT 
TCCAAAAAGC 
AAATCAAGAC 
CATCATGCTC 
TAAAGACACA 
AATAAAAAAC 
AACAATAGAA 
TAAAATTTTA 
AGTGTAA 
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GAGTCTTTGAGCATCTTAATAAATAAAAAATTGTTAGACTTTAAAGCCCCAGAAAATCCAAAAAGCTCAACTTTAA 
AAAA.TTTCAAAGAAATTAAAAATATTGAGAATTTCTTCCAAAATCAA.GACTTATTATTTGTCTTAACCCTTAAA.GA 
TAAAAATAACAACAACACTATTAACATCATGCTCAATCCCCCAAACGACATCCAAAAACCCAAAGATTATATTTTA 
AAAGACCTTAAAGACACAATTAAAAAGGGTACTGGTGAGAAATACTTAAATCCTATCTATAGATTTCAAATAAAAA 
ACAAAAAAGATTATCATTCAATAGATTACAACAAAGTGACTATTAGCGAAAAAACAATAGAATTGGACCTACTGCC 
TCACGAACAAGTCTTTCAAATGAATAAAAATTTCACTAAA 

f4-15.aa 

MSKKVILILL EILILSCDLS INKEQKTKEK TSEKQESEKQ NIEKQEPEKQ KQNAAKIIPT 
VSIQTVEIRE SNQIPKSIEK YYKQAYPIQT FTLDFSITRE KEFLKPEDKI LPTQGKVESL 
SILINKKLLD FKAPENPKSS TLKNFKEIKN IENFFQNQDL LFVLTLKBKN NNNTINIMLN 
PPNDIQKPKD YILKDLKDTI KKGTGEKYLN PIYRFQIKNK KDYHSIDYNK VTISEKTIEL 
DLLPHEQVFQ MNKNFTKILD TITDLNNLKL VIQKELV 

t4-15 -aa 

CDLSINKEQKTKEKTSEKQESEKQNIEKQEPEKQKQNAAKIIPTVSIQTVEIRESNQIPKSIEKYYKQAYPIQTFT 
LDFSITREKEFLKPEDKILPTQGKVESLSILINKKLLDFKAPENPKSSTLKNFKEIKNIENFFQNQDLLFVLTLKD 
KNNISTNTINIMLN^ 
HEQVF QMNKNFTK 

f 4-50 .nt 

TAGAAGGAGG AAAAAATGAA AATTGGAAAG CTAAATTCAA TAGTTATAGC CTTGTTTTTT 
AAACTATTGG TCGCATGTAG TATTGGATTA GTAGAAAGAA CAAATGCAGC TCTTGAATCG 
TCCTCTAAGG ATTTAAAAAA CAAAATTTTA AAAATAAAAA AAGAAGCCAC GGGAAAAGGT 
GTACTTTTTG AAGCTTTTAC AGGTCTTAAA ACCGGTTCCA AGGTAACAAG TGGTGGACTA 
GCCTTAAGAG AAGCAAAAGT ACAAGCCATT GTTGAAACAG GAAAGTTCCT TAAGATAATA 
GAAGAAGAAG CTTTAAAGCT TAAAGAAACT GGAAACAGTG GTCAATTCTT GGCTATGTTT 
GACTTAATGC TTGAGGTTGT AGAATCGCTA GAAGACGTTG GAATAATAGG CTTAAAAGCC 
CGTGTTTTAG AGGAATCTAA AAATAATCCT ATAAACACAG CTGAAAGATT GCTTGCGGCT 
AAAGC TC AAA TAGAAAATCA ACTTAAAGTG GTTAAGGAAA AACAAAATAT TGAAAATGGT 
GGAGAGAAAA AAAATAATAA AAGCAAAAAA AAGAAATAA 

t4-50 .nt 

ATGTAGTATTGGATTAGTAGAAAGAACAAATGCAGCTCTTGAATCGTCCTCTAAGGATTTAAAAAACAAAATTTTA 
AAAATAAAAAAAGAAGCCACGGGAAAAGGTGTACTTTTTGAAGCTTTTACAGGTCTTAAAACCGGTTCCAAGGTAA 
CAAGTGGTGGACTAGCCTTAAGAGAAGCAAAAGTACAAGCCATTGTTGAAACAGGAAAGTTCCTTAAGATAATAGA 
AGAAGAAGCTTTAAAGCTTAAAGAAACTGGAAACAGTGGTCAATTCTTGGC.TATGTTTGACTTAATGCTTGAGGTT 
GTAGAATCGCTAGAAGACGTTGGAATAATAGGCTTAAAAGCCCGTGTTTTAGAGGAATCTAAAAATAATCCTATAA 
ACACAGCTGAAAGATTGCTTGCGGCTAAAGCTCAAATAGAAAATCAACTTAAAGTGGTTAAGGAAAAACAAAATAT 
TGAAAATGGTGGAGAGAAAAAAAATAATAAAAGCAAAAAAAAGAAA 

f4-50.aa 

KEEKMKIGKL NSIVIALFFK LLVACSIGLV ERTNAALESS SKDLKNKILK IKKEATGKGV 

LFEAFTGLKT GSKVTSGGLA LREAKVQAIV ETGKFLKIIE EEALKLKETG NSGQFLAMFD 

LMLEWESLE DVG 1 1 GLKAR VL2ESKNNPI NTASRLLAAK AQIENQLKW KEKQNIENGG 

EKKNNKSKKK K 



t4-50 .aa 
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TABLE 1 . Nucleotide and Amino Acid Sequences 



CSIG1THP.TNAALESSSKDLKNKILKIKKEATGKGVLFEAFTGLKTGSKVTSGGLALREAKVQAIVETGKFLKIIS 

EEAXXI.XETGNSGQFLAMFDLMLEWESLEDVGIIGLKA^ 

EKGG ZK2C WK S KKKK 



TAAT-.TTTAA AATTTAAATA 
ATTTTACTTAT TTGTTATTTT 
AGGAGAAGGC CTTCAAAGGT 
ATTTCTTTTG TAGATTCTTT 
GTAGTTCTTG ATAGCAATCC 
GTAAAAGAAA TGCTTGTTAC 
GCTCTTC-AGA GCTCAAATGT 
GAAAAAACTT ACATAGATAA 
GCTTGTAATC TTGTAAAAAA 



TTTACATAAT AGTAATGTGT 
ATTATTCTTT TCTTGTAAAG 
TTTAAATGCT TCTAATGGTG 
AAATGATGAT CAAAAAGAAG 
CGACAAGTTT AATCAAATTT 
TGTTGTTAAG TGTTTAAAGG 
TGCAAATGTT GCCAATGCTA 
TTTGCGACAA TCTTTTATGA 
TTATGATGCA TCTGCTTCGT 



GTGGGAGACG TATGAAAAAT 
AATTTAATTA TTCTGATCTT 
CAT C AAAT AA AGAACTTAAA 
CTTTGTTTTT TCTTGAACAG 
TTAATTTAAA TGAAGAGAAG 
CCAAAAGAAA GGCTAAAATG 
AACAGCAATT GCTACAGGTT 
CTACTAAAAA CATTGAAGAG 
TTTAA 



TTGTAAAGAATTTAATTATTCTGATCTTAGGAGAAGGCCTTCAAAGGTTTTAAATGCTTCTAATGGTGCATCAAAT 
AAAGAACTTAAAATTTCTTTTGTAGATTCTTTAAATGATGATCAAAAAGAAGCTTTGTTTTTTCTTGAACAGGTAG 
TTCT^GATAGCAATCCCGACAAGTTTAATCAAATTTTTAATTTAAATGAAGAGAAGGTAAAAGAAATGCTTGTTAC 
TGTTGTTAAGTGTTTAAAGGCCAAAAGAAAGGCTAAAATGGCTCTTGAGAGCTCAAATGTTGCAAATGTTGCCAAT 
GCTAAACAGCAATTGCTACAGGTTGAAAAAACTTACATAGATAATTTGCGACAATCTTTTATGACTACTAAAAACA 
TTOAAGAGGCTTGTAATCTTGTAAAAAATTATGATGCATCTGCTTCGTTT 

£4-SS.aa 

FLKFKYLHNS NVCGRRMKNI LLFVILLFFS CKEFNYSDLR RRPSKVLNAS NGASNKELKI 

SFVD5LNDDQ KEALFFLEQV VLDSNPDKFN QIFNLNEEKV KEMLVTWKC L KAKRKAKMA 

LE3SW.WA NAKQQLLQVE KTYIDNLRQS FMTTKNIEEA CNLVKNYDAS ASF 



CKEFZjF/SDLRRRPSKVLNASNGASNKELKISFVDSLNDDQKEALFFLEQWLDSNPDKFNQIFNLNESKVKEMLVT 
VVTKCLXAXRKAKMALESSOTAWANAK^^ 



TAATTATTAA aatctaagga gaagagattt atgaacaaaa aattttctat ttcattatta 

TCTACAATAT TAGCCTTCTT GTTAGTATTA GGTTGTGATT TGTCAAGCAA TAATGCTGAA 
AACAAAATGG ATGATATTTT TAATTTAGAA AAGAAATACA TGGATAATTC AAATTATAAA 
TGTTTAAGTA AAAATGAGGC TATAGTTAAA AATTCTAAAA TTAAATTAGG TGTAAATAAT 
ACTAGAAGTC GTTCTTATTC TTCTAGAGAG ACTAATGTTT CGGATTCCTA TAATAAAACC 
TATTCATATT GCAAAAGCAA CTGA 

t42-l.ru: 

TTGTGATTTGTCAAGCAATAATGCTGAAAACAAAATGGATGATATTTTTAATTTAGAAAAGAAATACATGGATAA.T 
TCAAATTATAAATGTTTAAGTAAAAATGAGGCTATAGTTAAAAATTCTAAAATTAAATTAGGTGTAAATAATACTA 
GAAGTCGTTCTTATTCTTCTAGAGAGACTAATGTTTCGGATTCCTATAATAAAACCTATTCATATTGCAAAAGCAA 
C 

f 42-1 .aa 



LLKSK2KRFM NKKFSISLLS TILAFLLVLG CDLSSNNAEN KMDDIFNLEK KYMDNSNYKC 
L5KNEAIVKN SKIKLGVNNT RSRSYSSRET NVSDSYNKTY SYCKSN 



TABLE 1 . Nucleotide and Amino Acid Sequences 
t42-l .aa 

CDLSSNNAENKMDDIFNLEKKYMDN^ 
f43-3 .nt 



TGAATATTAA TAATAAAAAA AGGAATAANA ATGAAAATTA TCAACATATT ATTTTGTTTA 

TTTTTACTAA TGCTAAACAG CTGTAATTCT AATGATACTA AT ACTAGC C A AACAAAAAGT 

AGACAAAAAC GTGATTTAAC CCAAAAAGAA GCAACACAAG AAAAACCAAA ATC TAAAGAA 

GACCTGCTTA GAGAAAAGCT ATCTGAAGAC CAAAAAACAC ATCTTGACTG GTTAAAAACC 

GCTTTAACTG GTGCTGGAGA ATTTGATAAA TTTTTAGGAT ATGAC GAAGA CAAAATAAAA 

GGTGCACTTA ATC AT AT AAA GAGTGAACTT GATAAGTGTA CTGGGGATAA TTCTGAACAA 

CAAAAAAGCA CCTTCAAAGA GGTGGTTAAG GGGGCTCTTG GTGGCGGTAT AGATAGTTTT 

GCAACTAGTG CAAGTAGTAC CTGCCAAGCT CAGCAATAA 

t43-3.nt 



CTGTAATTCTAATGATACTAATACTAGCCAAACAAAAAGTAGACAAAAACGTGATTTAACCCAAAAAGAAGCAACA 
CAAGAAAAACCAAAATCTAAAGAAGACCTGCTTAGAGAAAAGCTA'TCTGAAGACCAAAAAACACATCTTGACTGGT 
TAAAAACCGCTTTAACTGGTGCTGGAGAATTTGATAAATTTTTAGGATATGACGAAGACAAAATAAAAGGTGCACT 
TAATCATATAAAGAGTGAACTTGATAAGTGTACTGGGGATAATTCTGAACAACAAAAAAGCACCTTCAAAGAGGTG 
GTTAAGGGGGC TCTTGGTGGCGGTAT AGATAGTTTTGCAACTAGTGC AAGTAGT AC C TGC CAAGCTCAGCAA 

f43-3 -aa 



ILIIKKGIXM KIINILFCLF LLMLKSCNSN DTNTSQTKSR QKRDLTQKEA TQEKPKSKED 
LLREKLSEDQ KTHLDWLKTA LTGAGEFDKF LGYDEDKIKG ALNHIKSELD KCTGDNSEQQ 
KSTFKEWKG ALGGGIDSFA TSASSTCQAQ Q 



t43-3 .aa 



CNSNDTNTSQTKSRQKRDLTQKEATQEKPKSKEDLLREKL.SEDQKTHLDWLKTALTGAGEFDKFLGYDEDKIKGAL 
NHIKSELDKCTGDNSEQQKSTFKEWKGALGGGIDSFATSASSTCQAQQ 

f45-2.nt 



TAGGAGAGAA TAATTATGAA TAAAAAAACA 
ATTTCTTGCA AGAATTTTGC AAC TGGT AAA 
AAAGGATTTG TAAATAAGAT TTTAGATCCA 
AAAGTAGATG AAGTAGCAAA AAAATTACAA 
GATGATCCTA ATGGCAGTGG AATAAATCCG 
AATGCATTAG TATTAAAAGC AATAGAACAA 
GAAGCTGAAG CTAAAGTTGA AGAAAATAAA 
AAAGAAAAAG AAATAATAGA C GAAC AAAAC 
GAACAACAAA AAGAACAAAA AAGACATCAA 
AAAGAAAAAA GAGAAAGAGA AGAGGCAGAA 
AAAAGGCAAG TTGATAACCA AATTAAAACA 
AATATTGATG TTATAAAATG GCAAACGACT 
ACTGGGCCTG TGTATGATGA TTTTACCAAT 
GGGTTAGAAG AGGAATCAGA AGACGAAGGA 
GCTAGGGACG CGCTAAGAAC TAAATTAAAT 
GAGCCTAAGT . TAAAAGAAAG TGTAAATGTT 
AAATCAAAAT TAGAAGAAGT TAAAAAATAT 
AAAGGATACA T C AGTGAC AG TCAGTAA 



TTGATTATTT GTGCTGTTTT TGCGCTGATA 
GATATAAAAC AAAATTCAGA AGGGAAAATT 
GTAAAGGATA AAATTGCTTC AAGTGGTACA 
GAAGAAGAAA AAGAAGAATT AATGCAGGGC 
CCACCAGTAT TGCCGGAAAA TATTCACAAT 
AGTGATGGTC AACAAGAAAA AAAAGTAGAA 
GAAAAACAAG AGAATACAGA AGAAAACATT 
AAACAAGAAT TAGCTAAAGC TAAAGAAGAA 
GAAGAGCAAC AAAGAAAAGC TAAAGCAGAA 
CAACAAAAAC GACAACAAGA AGAGGAAGAA 
CTTATAGCTA AAATAGATGA GATCAATGAA 
GTAGGC C C AC AAGGC GTT AT AGATAGAATT 
GGCAATAATT CTATACGCGA AAC TTGGGAG 
TTAGGAAAAT TATTGAAAGA ATTGAGTGAT 
GAAGGCAATA AACCATATAC TGGTTACGAA 
AGCGAAATTA AAGAAGATTT AGAAAAATTA 
CTTAAAGATA GTTCTAAATT TGAAGAAATT 



t45-2 .n; 




TABLE 1. Nucleotide and Amino Acid Sequences 



TTGCAAGAATTTTGCAACTGGTAAAGATATAAAACAAAATTCAGAAGGGAAAATTAAAGGATTTGTAAATAAGATT 
TTAGATCCAGTAAAGGATAAAATTGCTTCAAGTGGTACAAAAGTAGATGAAGTAGCAAAAAAATTACAAGAAGAAG 
AAAAAGAAGAATTAATGCAGGGCGATGATCCTAATGGCAGTGGAATAAATCCGCCACCAGTATTGCCGGAAAATAT 
TCACAATAATGCATTAGTATTAAAAGCAATAGAACAAAGTGATGGTCAACAAGAAAAAAAAGTAGAAGAAGCTGAA 
GCTAAAGTTGAAGAAAATAAAGAAAAACAAGAGAATACAGAAGAAAACATTAAAGAAAAAGAAATAATAGACGAAC 
AAAACAAACAAGAATTAGCTAAAGCTAAAGAAGAAGAACAACAAAAAGAACAAAAAAGACATCAAGAAGAGCAACA 
AAGAAAAGCTAAAGCAGAAAAAGAAAAAAGAGAAAGAGAAGAGGCAGAACAACAAAAACGACAACAAGAAGAGGAA 
GAAAAAAGGCAAGTTGATAACCAAATTAAAACACTTATAGCTAAAATAGATGAGATCAATGAAAATATTGATGTTA 
TAAAATGGCAAACGACTGTAGGCCCAGAAGGCGTTATAGATAGAATTACTGGGCCTGTGTATGATGATTTTACCAA 
TGGCAATAATTCTATACGCGAAACTTGGGAGGGGTTAGAAGAGGAATCAGAAGACGAAGGATTAGGAAAATTATTG 
AAAGAATTGAGTGATGCTAGGGACGCGCTAAGAACTAAATTAAATGAAGGCAATAAACCATATACTGGTTACGAAG 
AGCCTAAGTTAAAAGAAAGTGTAAATGTTAGCGAAATTAAAGAAGATTTAGAAAAATTAAAATCAAAATTAGAAGA 
AGTTAAAAAATATCTTAAAGATAGTTCTAAATTTGAAGAAATTAAAGGATACATCAGTGACAGTCAG 

f45-2 .aa 

ERI I MNKKTL IICAVFALII SCKNFATGKD IKQNSEGKIK GFVNKILDPV KDKIASSGTK 

VDEVAKKLQE EEKEELMQGD DPNGSGINPP PVLPENIHNN ALVLKAI EQS DGQQEKKVEE 

AEAKVEENKE KQENTEENIK EKEIIDEQNK QELAKAKEEE QQKEQKRHQE EQQRKAKAEK 

EKREREEAEQ QKRQQEEEEK RQVDNQIKTL IAKIDEINEN IDVIKWQTTV GPQGVIDRIT 

GPVYDDFTNG NNSIRETWEG LEEESEDEGL GKLLKELSDA RDALRTKLNE GNKPYTGYEE 

PKLKESVNVS EIKEDLEKLK SKLEEVKKYL KDSSKFEEIK GYISDSQ 

t45-2.aa 

CKNFATGKDIKQNSEGKIKGFVNKILDPVKDKIASSGTKVDEVAKKLQEEEKEELMQGDDPNGSGINPPPVLPENI 
HNNALVLKAIEQSDGQQEKKVEEAEAKVEENKEKQENTEENIKEKEIIDEQNKQELAKAKEEEQQKEQKRHQEEQQ 
RKAKAEKEKREREEAEQQKRQQEEEEKRQVDNQIKTLIAKIDEINENIDVIKWQTTVGPQGVIDRITGPVYDDFTN 
GNNSIRETWEGLEEESEDEGLGKLLKELSDARDALRTKLNEGNKPYTGYEEPKLKESVNVSEIKEDLEKLKSKLEE 
VKKYLKDSSKFEEIKGYISDSQ 

f47-2.nt 

TGAATATTAA TAATAAAAAA AGGAGTAACA ATGAAAATCA TCAACATATT ATTTTGTATA 
TCTTTGCTAC TACTAAATAG CTGTAATTCC AATGATAATG ACACTTTAAA AAACAATGCC 
CAACAAACAA AAAGCAGGAA AAAAC GTG AT TTAAGCCAAG AAGAACTGCC ' ACAACAAGAA 
AAAATCACTT TAACATCCGA CGAAGAAAAA ATGTTTACTT CATTAATCAA TGTGTTTAAA 
TACACAATTG AAAAATTAAA CAATGAAATA CAAGGGTGCA TGAATGGAAA CAAAAGTAAA 
TGTAATGACT TCTTTGATTG GCTTTCTGAA GATATTCAAA AACAAAAAGA ATTAGCTGGT 
GCTTTTACCA AGGTTTACAA CTTCTTAAAA TCAAAAGCAC AAAATGAAAC TTTTGATACT 
TATATTAAAG GAGCTATTGA TTGTAAAAAA AACACTCCAC AAGATTGTAA TAAAAATAAT 
GAAATATGGG GAGGTGGACA ACTTANTAGN GCAATATTTT AG 

t47-2.nt 

CTGTAATTCCAATGATAATGACACTTTAAAAAACAATGCCCAACAAACAAAAAGCAGGAAAAAACGTGATTTAAGC 
CAAGAAGAACTGCCACAACAAGAAAAAATCACTTTAACATCCGACGAAGAAAAAATGTTTACTTCATTAATCAATG 
TGTTTAAATACACAATTGAAAAATTAAACAATGAAATACAAGGGTGCATGAATGGAAACAAAAGTAAATGTAATGA 
CTTCTTTGATTGGCTTTCTGAAGATATT 

CAAAAACAAAAAGAATTAGCTGGTGCTTTTACCAAGGTTTACAACTTCTTAAAATCAAAAGCACAAAATGAAACTT 
TTGATACTTATATTAAAGGAGCTATTGATTGTAAAAAAAACACTCCACAAGATTGTAATAAAAATAATGAA 

f47-2 .aa 



ILIIKKGVTM KIINILFCIS . LLLLNSCNSN DNDTLKNNAQ QTKSRKKRDL SQEELPQQEK 



TABLE 1. Nucleotide and Amino Acid Sequences 

ITLTSDEEKM FTSLINVFKY TIEKLNNEIQ GCMNGNKSKC NDFFDWLSED IQKQKELAGA 
FTKVYNFLKS KAQNETFDTY IKGAIDCKKN TPQDCNKNNE IWGGGQLXXA IF 



CNSNDNDTLKKNAQQTKSRKKRDLSQEELPQQEKITLTSDEEKMFTSLIWFKYTIEKLMSTEIQGCMNGNKSKCND 
FFDWLSEDIQKQKELAGAFTKVYNFLKSKAQNETFDTYIKGAIDCKKNTPQDCNKNNE 



TAAATGTTCA AAACAATCAT TAAACAAAAA AATATGAAAA AAATTTCAAG TGCAATTTTA 
TTAACAACTT TCTTTGTTTT TATTAATTGT AAAAGCCAAG TTGCTGATAA GGCGAGTGTG 
ACGGGGATTG CTAAGGGAAT AAAGGAGATT GTTGAAGCTG CTGGGGGGAG TGAAAAGCTG 
AAAGTTGCTG CTGCTGAAGG GGAGAATAAT GAAAAGGCAG GGAAGTTGTT TGGGAAGGCT 
GGTGCTGGTA ATGCTGGGGA CAGTGAGGCT GCTAGCAAGG CGGCTGGTGC TGTTAGTGCT 
GTTAGTGGGG AGCAGATATT AAGTGCGATT GTTAAGGCTG CTGGTGAGGC TGCGCAGGAT 
GGAGAGAAGC CTGGGGAGGC TAAAAATCCG ATTGCTGCTG CTATTGGGAA GGGTAATGAG 
GATGGTGCGG AGTTTAAGGA TGAGATGAAG AAGGATGATC AGATTGCTGC TGCTATTGCT 
TTGAGGGGGA TGGC TAAGG A TGGAAAGTTT GCTGTGAAGA ATGATGAGAA AGGGAAGGCT 
GAGGGGGCTA TTAAGGGAGC TGGCGAGTTG TTGGATAAGC TGGTAAAAGC TGTAAAGACA 
GCTGAGGGGG CTTCAAGTGG TACTGCTGCA ATTGGAGAAG TTGTGGCTGA TGATAATGCT 
GCGAAGGTTG CTGATAAGGC GAGTGTGAAG GGGATTGCTA AGGGGATAAA GGAGATTGTT 
GAAGCTGCTG GGGGGAGTAA AAAGCTGAAA GTTGCTGCTG CTAAAGAGGG CAATGAAAAG 
GCAGGGAAGT TGTTTGGGAA AGTTGATGCT GCTCATGCTG GGGACAGTGA GGCTGCTAGC 
AAGGCGGCTG GTGCTGTTAG TGCTGTTAGT GGGGAGCAGA TATTAAGTGC GATTGTTAAG 
GCTGCTGGTG CGGCTGCTGG TGATCAGGAG GGAAAGAAGC CTGGGGATGC TAAAAATCCG 
ATTGCTGCTG CTATTGGGAA GGGTGATGCG GAGAATGGTG CGGAGTTTAA TCATGATGGG 
ATGAAGAAGG ATGATCAGAT TGCTGCTGCT ATTGCTTTGA GGGGGATGGC TAAGGATGGA 
AAGTTTGCTG TGAAGAGTGG TGGTGGTGAG AAAGGGAAGG CTGAGGGGGC TAT T AAGGG A 
GCTGCTGAGT TGTTGGATAA GCTGGTAAAA GCTGTAAAGA CAGCTGAGGG GGCTTCAAGT 
GGTACTGATG CAATTGGAGA AGTTGTGGCT AATGCTGGTG C TGC AAAGGT TGC TGATAAG 
GCGAGTGTGA CGGGGATTGC TAAGGGGATA AAGGAGATTG TTGAAGCTGC TGGGGGGAGT 
GAAAAGCTGA AAGTTGCTGC TGCTACAGGG GAGAGTAATA AAGGGGCAGG GAAGTTGTTT 
GGGAAGGCTG GTGCTGGTGC TAATGCTGGG GACAGTGAGG CTGCTAGCAA GGCGGCTGGT 
GCTGTTAGTG CTGTTAGTGG GGAGCAGATA TTAAGTGCGA TTGTTAAGGC TGCTGATGCG 
GCTGATCAGG AGGGAAAGAA GCCTGGGGAT GCTANAAATC CGATTGCTGC TGCTATTGGG 
AAGGGTNATG NGGAGAATGG TGCGGAGTTT AANNATGANG GATGA 



TTGTAAAAGCCAAGTTGCTGATAAGGCGAGTGTGACGGGGATTGCTAAGGGAATAAAGGAGATTGTTGAAGCTGCT 
GGGGGGAGTGAAAAGCTGAAAGTTGCTGCTGCTGAAGGGGAGAATAATGAAAAGGCAGGGAAGTTGTTTGGGAAGG 
CTGGTGCTGGTAATGCTGGGGACAGTGAGGCTGCTAGCAAGGCGGCTGGTGCTGTTAGTGCTGTTAGTGGGGAGCA 
GATATTAAGTGCGATTGTTAAGGCTGCTGGTGAGGCTGCGCAGGATGGAGAGAAGCCTGGGGAGGCTAAAAATCCG 
ATTGCTGCTGCTATTGGGAAGGGTAATGAGGATGGTGCGGAGTTTAAGGATGAGATGAAGAAGGATGATCAGATTG 
CTGCTGCTATTGCTTTGAGGGGGATGGCTAAGGATGGAAAGTTTGCTGTGAAGAATGATGAGAAAGGGAAGGCTGA 
GGGGGCTATTAAG 

f49-2.aa 

MFKTIIKQKN MKKIS3AILL TTFFVFINCK SQVADKASVT GIAKGIKEIV SAAGGSEKLK 

VAAAEGENNE KAGKLFGKAG AGNAGDSEAA SKAAGAVSAV SGEQILSAIV KAAGEAAQDG 

EKPGEAKNPI AAAIGKGNED GAEFKDEMKK DDQIAAAIAL RGMAKDGKFA VKNDEKGKAS 

GAIKGAGELL DKLVKAVKTA EGASSGTAAI GEWADDNAA KVADKASVKG IAKGIKSIVE 

AAGGSKKLKV AAAKEGNEKA GKLFGKVDAA HAGDSEAASK AAGAVSAVSG EQILSAIVKA 

AG AAAGDQ EG KKPGDAXNPI AAAIGKGDAE NGASFNHDGM KXDDQIAAAI ALRGMAKDGK 
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FAVKSGGGEK GKAEGAIKGA AELLDKLVKA VKTAEGASSG TDAIGEWAN AGAAKVADKA 
SVTGIAKGIK EIVEAAGGSE KLKVAAATGE SNKGAGKLFG KAGAGANAGD SEAASKAAGA 
VSAVSGEQIL SAIVKAADAA DQEGKKPGDA XNPIAAAIGK GXXENGAEFX XXG 



CKSQVADKASVTGIAKGIKEIVEAAGGSEKLKVAAAEGENNEKAGKLFGKAGAGNAGDSEAASKAAGAVSAVSGEQ 
ILSAIVKAAGEAAQDGEKPGEAKNPIAAAIGKGNEDGAEFKDEMKKDDQIAAAIALRGMAKDGKFAVKNDEKGKAE 
GAIK 

f5-14.nt 

TAGAAATTCA AAACAAAGGA C-AAAACAAAA AGTATGAATA AAAAAATATT GATTATTTTT 

GCTGTTTTTG CACTTATAAT TTCTTGTAAA AATTATGCAA CTGGTAAAGA TATAAAACAA 

AATGCAAAAG GGAAAATTAA AGGATTTTTA GATAAGGTTT TAGATCCAGC AAAAGATAAA 
ATTACTTCAA GTAGTTCAAA AGTAGATGAA TTAGCAAAAA AATTACAAGA AGAAGATGAA 

GATAATGAAT TAATGCAGGG CGATGATCCT AATAACAGAG CAATAGCACT GTTACCAGTA 

TTGCCGGAAA ATAGTCATGA CAATCCACCA GT AC C AAAAG TAAAAGCAGC AGCACAAAGT 

GGTGGTCAAC AAGAAGACCA AAAAGCAAAA GAATCTAAAG ATAAAGTTGA GGAAGAAAAA 

GAAGTTGTAG AGGAGAAAAA AGAAGAACAA GATAGTAAAA AAGAAAAAGT GGAGAAGCAA 

AGTCAAAAGC AAAAAGAAGA AGAGAGAAAC TCTAAAGAAG AACAACAAAA ACAAGAAGAA 

GCAAAAGCTA GAGCAGATAG AGAAAGAGAA GAACGACTAA AACAACAAGA ACAAAAAAGA 

CAACAGGAAG AAGCTAGGGT TAAAGCAGAA AAAGAAAAAC AAGAAAGAGA GGAACAACAA 

AAACAAGAAG AAGAAAAGAA AGTTAAATAT AAAATTAAAA CACTTACAGA CAAAATAGAT 

GAAATAAATA AGGATATTGA TGGTATAAAT GGTAAAACAA TTGTAGGAGC AGAAGAAGTT 

ATAGATAAAA TTACGGGGCC TGT AT AT GAT GATTTTACTG ATGGGAATAA AGCTATATAC 

AAAACTTGGG GAGATTTAGA GGATGAAGAA GGCGAAGAAT TAGGAAAATT ATTGAAAGAA 

TTGAGTGATA CTAGACATAA TTTAAGAACC AAATTAAATG AGGGTAATAA AGCATATATT 

GTTCTAGAAA AGGAGCCTAA TTTAAAAGAA AATGTAAATG TTAGTGATAT TCAATCAGAT 

TTAGAAAAAT TAAAATCAGG ATTAGAAGAA GTTAAAAAAT ATTTTGAAAA TGAAGATAAT 
TTTGAAGAAA T T AAAGG AT A CATTGAGGAT AGTAA'TTCAT ATTGA 

t5-I4.nt 

TTGTAAAAATTATGCAACTGGTAAAGATATAAAACAAAATGCAAAAGGGAAAATTAAAGGATTTTTAGATAAGGTT 
TTAGATCCAGCAAAAGATAAAATTACTTCAAGTAGTTCAAAAGTAGATGAATTAGCAAAAAAATTACAAGAAGAAG 
ATGAAGATAATGAATTAATGCAGGGCGATGATCCTAATAACAGAGCAATAGCACTGTTACCAGTATTGCCGGAAAA 
TAGTCATGAC AATC CAC CAGTAC C AAAAGTAAAAGC AGC AGC AC AAAGTGGTGGTCAACAAGAAGACC AAAAAGC A 
'AAAGAATCTAAAGATAAAGTTGAGGAAGAAAAAGAAGTTGTAGAGGAGAAAAAAGAAGAACAAGATAGTAAAAAAG 
AAAAAGTGGAGAAGCAAAGTCAAAAGCAAAAAGAAGAAGAGAGAAACTCTAAAGAAGAACAACAAAAACAAGAAGA 
AGCAAAAGCTAGAGCAGATAGAGAAAGAGAAGAACGACTAAAACAACAAGAACAAAAAAGACAACAGGAAGAAGCT 
AGGGTTAAAGCAGAAAAAGAAAAACAAGAAAGAGAGGAACAACAAAAACAAGAAGAAGAAAAGAAAGTTAAATATA 
AAATTAAAACACTTACAGACAAAATAGATGAAATAAATAAGGATATTGATGGTATAAATGGTAAAACAATTGTAGG 
AGCAGAAGAAGTTATAGATAAAATTACGGGGCCTGTATATGATGATTTTACTGATGGGAATAAAGCTATATACAAA 
ACTTGGGGAGATTTAGAGGATGAAGAAGGCGAAGAATTAGGAAAATTATTGAAAGAATTGAGTGATACTAGACATA 
ATTTAA.GAACCAAATTAJ\ATGAGGGTAATAAAGCATATATTGTTCTAGAAAAGGAGCCTAATTTAAAAGAAAATGT 
AAATGTTAGTGATATTCAATCAGATTTAGAAAAATTAAAATCAGGATTAGAAGAAGTTAAA_AAATATTTTGAAAAT 
GAAGATAATTTTGAAGAAATTAAAGGATACATTGAGGATAGTAATTCATAT 

£5-14.aa 

KFKTKEKTKS MNKKILIIFA VFALIISCKN YATGKDIXQN AKGKIKGFLD KVLDPAKDKI 

TSSSSKVDEL AKKLQEEDED NELMQGDDPN NRAIALLPVL PENSHDNPPV PKVKAAAQSG 

GQQEDQKAKS SKDKVEEEKE WEEKKEEQD SKKEKVEKQS QXQKEEERNS KEEQQKQEEA 

KARADREREE RLKQQEQKRQ QEEARVKAEK EKQEREEQQK QEEEKKVKYK IKTLTDKIDE 

INKDIDGING KTIVGAEEVI DKITGPVYDD FTDGNKAIYK TWGDLEDEEG EELGKLLKSL 
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SDTRHNLRTK LNEGNKAYIV LEKEPNLKEN VTWSDIQSDL EKLKSGLEEV KKYFENEDNF 
EEIKGYIEDS NSY 

t5-14 .aa 

CKNYATGKDIKQNAKGKIKGFLDKVLDPAKDKITSSSSKVDELAKKLQEEDEDNELMQGDDPNNRAIALLPVLPEN 
SHDNPPVPKVKAAAQSGGQQEDQKAKESKDKVEEEKEWEEKKEEQDSKKEKVEKQSQKQKEEERNSKEEQQKQEE 
AKARADREREERLKQQEQKRQQEEARVKAEKEKQEREEQQKQEEEKKVKYKIKTLTDKIDEINKDIDGINGKTIVG 
AEEVIDKITGPVYDDFTDGNKAIYKTWGDLEDEEGEELGKLLKELSDTRHNLRTKLNEGNKAYIVLEKEPNLKENV 
NVSDIQSDLEKLKSGLEEVKKYFENEDNFEEIKGYIEDSNSY 

f5-15 -nt 

TAACTTATGA ATAAGAAAAT GAAAATGTTT ATTATTTGTG CTGTTTTTGC ATTGATGATT 
TCTTGCAAGA ATTATGCAAG TGGTGAAAAT CTAAAAAATT CAGAACAAAA TCTAGAAAGT 
TCAGAACAAA ATGTAAAAAA AACAGAACAA GAGATAAAAA AACAAGTTGA AGGATTTTTA 
GAAATTCTAG AGACAAAAGA TTTATCTAAA TTAGATGAAA AAG AT AC AAA AGAAATTC-AA 
AAACAAATTC AAGAATTAAA GAATAAAATA GAAAAATTAG ATTCTAAAAA AACTTCTATT 
GAAACATATT CTGAGTATGA AGAAAAAATA AACAAAATAA AAGAAAAATT GAAAGGAAAA 
GGACTTGAAG ATAAATTTAA GGAGC T TGAA GAGAGTTTAG CAAAGAAAAA GGGGGAGAGA 
AAAAAAGCTT TACAAGAGGC CAAACAGAAA TTTGAAGAAT ATAAAAAACA AGTAGATACT 
TCAACTGGGA AAACTCAAGG CGACAGGTCT AAAAACCGAG GTGGTGTTGG AGTGCAAGCT 
TGGCAGTGTG CCAATGAATT AGGTTTGGGT GTAAGTTATT CTAATGGCGG CAGTGACAAC 
AGCAATACTG ATGAATTAGC AAACAAAGTT ATAGATGATT CTCTTAAAAA GATTGAAGAA 
GAAC TTAAGG GAATAGAAGA AGATAAAAAA GAATAA 

t5-15 -nt 

TTGCAAGAATTATGCAAGTGGTGAAAATCTAAAAAATTCAGAACAAAATCTAGAAAGTTCAGAACAAAATGTAAAA 
AAAACAGAACAAGAGATAAAAAAACAAGTTGAAGGATTTTTAGAAATTCTAGAGACAAAAGATTTATCTAAATTAG 
ATGAAAAAGATACAAAAGAAATTGAAAAACAAATTCAAGAATTAAAGAATAAAATAGAAAAATTAGATTCTAAAAA 
AACTTCTATTGAAACATATTCTGAGTATGAAGAAAAAATAAACAAAATAAAAGAAAAATTGAAAGGAAAAGGACTT 
GAAGATAAATTTAAGGAGCTTGAAGAGAGTTTAGCAAAGAAAAAGGGGGAGAGAAAAAAAGCTTTACAAGAGGCCA 
AACAGAAATTTGAAGAATATAAAAAACAAGTAGATACTTCAACTGGGAAAACTCAAGGCGACAGGTCTAAAAACGG 
AGGTGGTGTTGGAGTGCAAGCTTGGCAGTGTGCCAATGAATTAGGTTTGGGTGTAAGTTATTCTAATGGCGGCAGT 
GACAACAGCAATACTGATGAATTAGCAAACAAAGTTATAGATGATTCTCTTAAAAAGATTGAAGAAGAACTTAAGG 
GAAT AGAAGAAG AT AAAAAAG AA 

f5-15 .aa 

LMNKKMKMFI ICAVFALMIS CKNYASGENL KNSEQNLESS EQNVKKTEQE IKKQVEGFLE 
ILETKDLSKL DEKDTKEIEK QIQELKNKIE KLDSKKTSIE TYSEYEEKIN KIKEKLKGKG 
LEDKFKELEE SLAKKKGERK KALQEAKQKF EEYKKQVDTS TGKTQGDRSK NRGGVGVQAW 
QCANELGLGV SYSNGGSDNS NTDELANKVI DDSLKKIEEE LKGIEEDKKE 

t5-15.aa 

CKNYASGENLKNSEQNLESSEQNVKKTEQEIKKQVEGFLEILETKDLSKLDEKDTKEIEKQIQELKNKIEKLDSKK 
TSIETYSEYEEKINKIKEKLKGKGLEDKFKELEESLAKKKGERKKALQEAKQKFEEYKKQVDTSTGKTQGDRSKNR 
GGVGVQAWQCANELGLGVSYSNGGSDNSNTDELANKVIDDSLKKIEEELKGIEEDKKE 

f51-2.nt 

TAATTGTTTG GGGTTGTGGT AAACTTAAGG CTTATGGAGT GGATTATGAA TAAAAAAATG 
AAAATATTTA TTATTTGTGC TGTATTTGTG CTGATAAGTT CTTGCAAGAT TGATGCAACT 
GGTAAAGATG CAACTGGTAA AGATGCAACT GGTAAAGATG CAACTGGTAA AGATGCAACT 
GGTAAAAATG CAGAACAAAA TATAAAAGGG AAAGTTCAAG GATTTTTAGA AAAGATTTTA 
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GATCCAGTAA AGGATAAAAT TGCTTCAAAT GGTCCAATAG CAGATGAATT GGCAAAAAAA 
TTACAAGAAG AAGAAAAGGT AAATAACGGG GAAGAAGAAA ATGATAAAGC TGTCTTTTTA 
GGAGAAGAAT CAAAAGAGGA TGAAGAAGAA AATGAGCAAG CTGTTAATTT AGAAGAAAAA 
AATGCGGAAG AGGATAAGAA AGTTGTTAAT TTAGAAGAGA AAGAATTAGA AGTTAAAAAA 
GAGACTGAAG AAGATGAAGA TAAAGAAGAA ATAGAGAAAC AAAAACAAGA AGTGGAAAAA 
GCACAAGAAA GAAAACAACG ACAAGAAGAA AAGAAACGAA AAAAACAAGA ACAGCAAGAA 
GAAAAGAAAC GAAAACGACA AGAACAAAGA AAAGAAAGGA GAGCTAAAAA CAAAATTAAA 
AAACTTGCGG ATAAAATAGA TGAGATAAGT TGGAATATTG ATGGTATAGA AAGT CAAAC A 
AGTGTAAAAC CGAAAGCAGT TATAGATAAA ATTACGGGGC CTGTATATGA TTATTTTACC 
GATGACAACA AAAAAGCTAT ATATAAAACA TGGGGAGATT TAGAAGATGA AGAAGGCGAA 
GGATTGGGAA AATTATTGAA AGAATTGAGT GATACTAGAG ATGAGTTAAG AACCAAATTA 
AATAAAGATA ATAAAAAATA TTATGCCCAT GAAAATGAGC CTCCTCTAAA AGAAAATGTA 
GATGTCAGCG AAATTAAAGA AGATTTAGAA AAAGTAAAAT CAGGATTAGA AAAGGTTAAA 
GAATATCTTA AAGACAATTC TAAATTTGAA GAAATTAAAG GATACATCAG TTACAGTCAG 
TAA 

tSl-2 .nt 

TTGCAAGATTGATGCAACTGGTAAAGATGCAACTGGTAAAGATGCAACTGGTAAAGATGCAACTGGTAAAGATGCA 
ACTGGTAAAAATGCAGAACAAAATATAAAAGGGAAAGTTCAAGGATTTTTAGAAAAGATTTTAGATCCAGTAAAGG 
ATAAAATTGCTTCAAATGGTCCAATAGCAGATGAATTGGCAAAAAAATTACAAGAAGAAGAAAAGGTAAATAACGG 
GGAAGAAGAAAATGATAAAGCTGTCTTTTTAGGAGAAGAATCAAAAGAGGATGAAGAAGAAAATGAGCAAGCTGTT 
■ AATTTAGAAGAAAAAAATGCGGAAGAGGATAAGAAAGTTGTTAATTTAGAAGAGAAAGAATTAGAAGTTAAAAAAG 
'AGACTGAAGAAGATGAAGATAAAGAAGAAATAGAGAAACAAAAACAAGAAGTGGAAAAAGCACAAGAAAGAAAACA 
ACGACAAGAAGAAAAGAAACGAAAAAAACAAGAACAGCAAGAAGAAAAGAAACGAAAACGACAAGAACAAAGAAAA 
GAAAGGAGAGCTAAAAACAAAATTAAAAAACTTGCGGATAAAATAGATGAGATAAGTTGGAATATTGATGGTATAG 
AAAGTCAAACAAGTGTAAAACCGAAAGCAGTTATAGATAAAATTACGGGGCCTGTATATGATTATTTTACCGATGA 
CAACAAAAAAGCTATATATAAAACATGGGGAGATTTAGAAGATGAAGAAGGCGAAGGATTGGGAAAATTATTGAAA 
GAATTGAGTGATACTAGAGATGAGTTAAGAACCAAATTAAATAAAGATAATAAAAAATATTATGCCCATGAAAATG 
AGCCTCCTCTAAAAGAAAATGTAGATGTCAGCGAAATTAAAGAAGATTTAGAAAAAGTAAAATCAGGATTAGAAAA 
GGTTAAAGAATATCTTAAAGACAATTCTAAATTTGAAGAAATTAAAGGATACATCAGTTACAGTCAG 

f51-2 .aa 

LFGWVNLRL MEWIMNKKMK IFIICAVFVL ISSCKIDATG KDATGKDATG KDATGKDATG 
KNAEQNIKGK VQGFLEKILD PVKDKIASNG PIADELAKKL QEEEKWINGE EENDKAVFLG 
EESKEDEEEN EQAVNLEEKN AEEDKKWNL EEKELEVKKE TEEDEDKEEI EKQKQEVEKA 
QERKQRQEEK KRKKQEQQEE KKRKRQEQRK ERRAKNKIKK LADKIDEISW NIDGIESQTS 
VKPKAVIDKI TGPVYDYFTD DNKKAI YKTW GDLEDEEGEG LGKLLKELSD TRDELRTKLN 
KDNKKYYAHE NEPPLKENVD VSEIKEDLEK VKSGLEKVKE YLKDNSKFEE IKGYISYSQ 

t51-2.aa 

CKIDATGKDATGKDATGKDATGKDATGKNAEQNIKGKVQGFLEKILDPVKDKIASNGPIADELAKKLQEEEKA/NWG 
EEENDKAVFLGEESKEDEEENEQAVNLEEKNAEEDKKVVNLEEKEL.EVKKETEEDEDKEEIEKQKQEVEKAQERKQ 
RQEEKKRKKQEQQEEKKRKRQEQRKERRAKNKIKKLADKIDEISWNIDGIESQTSVKPKAVIDKITGPVYDYFTDD 
NKKAIYKTWGDLEDEEGEGLGKLLKEL3DTRDELRTKLNKDNKKYYAHENEPPLKENVDVSEIKEDLEKVKSGLEK 
VKEYLKDNSKFEEIKGYISYSQ 

f6-21.nt 

TAGGCAAAAT TTAAATTTAT AAAAAC TTGT AAGGATGCTT GTATGAAAAT ATTGATAAAA 
AAGTTAAAAG TTGTATTATT TCTCAATTTA ATTTTACTTA TTTCTTGTGT TAATGAAAGT 
AATAGAAACA AATTGGTTTT TAAGCTAAAT ATTGGAAGTG AGCCTGCTAC TTTA.GATGCT 
CAATTAATAA AC GATACGGT TGGATCAGGG ATTGTAAGCC AAATGTTTCT TGGCATTTTA 
GATGGAGATC CCAGGACTGG AGGATACAGA CCGGGACTTG CTAAAAGTTG GGATATTTCT 



205 

TABLE 1 . Nucleotide and Amino Acid Sequences 



GATGACGGAG TAGTTTATAC GTTTCATTTA AGAGATAATC TTGTTTGGAG TGATGGAGTT 
TCCATTACTG CCGAAGAATA A 

t6-21 .nt 

TTGTGTTAATGAAAGTAATAGAAACAAATTGGTTTTTAAGCTAAATATTGGAAGTGAGCCTGCTACTTTAGATGCT 
CAATTAATAAACGATACGGTTGGATCAGGGATTGTAAGCCAAATGTTTCTTGGCATTTTAGATGGAGATCCCAGGA 
CTGGAGGATACAGACCGGGACTTGCTAAAAGTTGGGATATTTCTGATGACGGAGTAGTTTATACGTTTCATTTAAG 
AGATAATCTTGTTTGGAGTGATGGAGTTTCCATTACTGCCGAAGAA 

f6-21.aa 

AKFKFIKTCK DACMKILIKK LKWLFLNLI LLISCVNESN RNKLVFKLNI GSEPATLDAQ 
LINDTVGSGI VSQMFLGILD GDPRTGGYRP GLAKSWDISD DGWYTFHLR DNLVWSDGVS 
ITAEE 

t6-21 .aa 

CVNESNRNKLVFKLNIGSEPATLDAQLINDTVGSGIVSQMFLGILDGDPRTGGYRPGLAKSWDISDDGWYTFHLR 
DNLVWSDGVSITAEE 

f 6-27 .nt 

TAAAGAAAAG CTTGCATAAA AAGTATAACA AATTCTTTAA TAATTAAAAT CAAAAAGAAT 
ATAATTATTG C AC T AAAATT AAATTTATAC AGTTATATAG AATCACTTAA GGAACAAAAA 
ATGAAATACC TTAAAAACAT TTCCTTATTT TTGTTAATTT TAGGTTGCAA ATCCATCCCA 
AATGGTAATT TCAATCTACA C GATAC AAAC CATAAATTAG GAAAACTAAA ATTTCAAGAA 
GACTCGATAA TAAGCAGAAA TTATGATAAT AAAATATCCA TTGTGGGAGT AT AC AAC C C T 
TTAACAGAAA AAGAAAATTT TAAAGTCAAT ATTTTCATCA AAAAAAAAGG ATTACAAATA 
GATCCTGAAA ATATTTTGAT AAATGAAGAA AAAATTAATT ATTCAAAATA TAAAGCAGAA 
CTCAAAGTAA AATCTAGCTT TAATAAAAGC ATTATCAGTA TTTCACTAAC TAATTCAAGA 
GATCTATTAA CCTACATTTA CGATAAAAGC ACAGGGAAAT ACATTAACAT TGACTTTAAG 
GACAATTGGA AC GT ATCGCA CAGTATAAAA TTTAATAAGG AGTATATTTT AGCATATATA 
ACAGATTTTG ATAAAGAAAT TAAAATATCT AAAAATATTT TGCAAAAACG TATTGATAAT 
AGAAAAATTG AAATTGAAAA AACAGAGCTT AAAACAGAAT ATAATGAAAT AGAGGATTAT 
TACATCTACA GTATGAAAAT TCCAAAATTA TTTGAAAAAT CAGACGCTCC - CTCTGAAACT 
T AC GAAAC AT TTGTTATAGC AAATTATTAC CCCTGTGAAA ATTTAAATAT ACTGTTTTTG 
AATTTAAGCT TATACTCTGA TAAATTACGC TTTCTAAACT CTATTTATGA TGAGAATGAT 
AGAAAATTAA AAATGGAGCC TCCTGTGAGA GCCTTAAAGA ATTCAAAAAC AATAAAAGAA 
ACATTAAATA TAGTATTAAG TCCTCAAAAA ATAATAGAGC TAGCAAAAAA CATTGAAAAA 
GATATTACTC T AAAATT AAA ATC TTACGGA GAAAAGGGAG AATTCACATT TGAAATATAT 
AAACCACTTC TTTTAAAATT CTTAAAAGAA GTAGATCATT GCATAAAAAA TTTGCAATCA 
AGTAGGCATA AATTTTAA 

t6-27 .nt 

TTGCAAATCCATCCCAAATGGTAATTTCAATCTACACGATACAAACCATAAATTAGGAAAACTAAAATTTCAAGAA 
GACTCGATAATAAGCAGAAATTATGATAATAAAATATCCATTGTGGGAGTATACAACCCTTTAACAGAAAAAGAAA 
ATTTTAAAGTCAATATTTTCATCAAAAAAAAAGGATTACAAATAGATCCTGAAAATATTTTGATAAATGAAGAAAA 
AATTAATTATTCAAAATATAAAGCAGAACTCAAAGTAAAATCTAGCTTTAATAAAAGCATTATCAGTATTTCACTA 
ACTAATTCAAGAGATCTATTAACCTACATTTACGATAAAAGCACAGGGAAATACATTAACATTGACTTTAAGGACA 
ATTGGAACGTATCGCACAGTATAAAATTTAATAAGGAGTATATTTTAGCATATATAACAGATTTTGATAAAGAAAT 
TAAAATATCTAAAAATATTTTGCAAAAACGTATTGATAATAGAAAAATTGAAATTGAAAAAACAGAGCTTAAAACA 
GAATATAATGAAATAGAGGATTATTACATCTACAGTATGAAAATTCCAAAATTATTTGAAAAATCAGACGCTCCCT 
CTGAAACTTACGAAACATTTGTTATAGCAAATTATTACCCCTGTGAAAATTTAAATATACTGTTTTTGAATTTAAG 
CTTATACTCTGATAAATTACGCTTTCTAAACTCTATTTATGATGAGAATGATAGAAAATTAAAAATGGAGCCTCCT 
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GTGAGAGCCTTAAAGAATTCAAAAACAATAAAAGAAACATTAAATATAGTATTAAGTCCTCAAAAAA.TAATAGAGC 
TAGCAAAAAACATTGAAAAAGATATTACTCTAAAATTAAAATCTTACGGAGAAAAGGGAGAATTCACATTTGAAAT 
ATATAAACCACTTCTTTTAAAATTCTTAAAAGAAGTAGATCATTGCATAAAAAATTTGCAATCAAGTAGGCATAAA 

f 6-27 .aa 

RKACIKSITN SLIIKIKKNI IIALKLNLYS YIESLKEQKM KYLKNISLFL LILGCKSIPN 

GNFNLHDTNH KLGKLKFQED SIISRNYDNK ISIVGVYNPL TEKENFKVNI FIKKKGLQID 

PENILINEEK INYSKYKAEL KVKSSFNKSI ISISLTNSRD LLTYIYDKST GKYINIDFKD 

NWNVSHSIKF NKEYILAYIT DFDKEIKISK NILQKRIDNR KIEIEKTELK TEYNEIEDYY 

IYSMKIPKLF EKSDAPSETY ETFVIANYYP CENLNILFLN LSLYSDKLRF LNS I YDENDR 

KLKMEPPVRA LKNSKTIKET LNIVLSPQKI IELAKNIEKD ITLKLKSYGE KGEFTFEIYK 
PLLLKFLKEV DHCIKNLQSS RHKF 

t6-27 .aa 

CKSIPNGNFNLHDTNHKLGKLKFQEDSIISPJNTYDNKISIVGVYNPLTEKENFKVNIFIKKKGLQIDPENILINEEK 
IKTYSKYKAELKVKSSFNKSIISISLTNSRDLLTYIYDKSTGKYINIDFKDNWNVSHSIKFNKEYILAYITDFDKEI 
KISKNILQKRIDNRKIEIEKTELKTEYNEIEDYYIYSMKIPKLFEKSDAPSETYETFVIANYYPCENLNILFLNLS 
LYSDKLRFLNSIYDENDRKLKMEPPVRALKNSKTIKETLNIVLSPQKIIELAKNIEKDITLKLKSYGEKGEFTFEI 
YKPLLLKFLKEVDHCIKNLQSSRHKF 

f6-5 .nt 

TAAATGAAGA AGTTTTTAAT ATCCGTTTAT TTTTTATTGT TTTATGGTTG TTCAACTATA 
TCTTTGGTAA AAATACCAGA AAAAGATAAA ATAAATTTAA CTGTTTTATC ATCTTTAATG 

AATTATCCTG ATTTGAAGAT TTCAAATTTT AAAATAAAAG ACTACGAACA TTTGCATTAT 

TCATCTGATT TTGAAAGCTT GAGTGATACT AAAAATAGTG CTTATATTTA CGTTGATGAA 

TCTAGTTTCA ATAATAATAT TAATTTTATT AAAGATCTTT TTATTTATAA TAAGAAATTA 

TATAGAATAC TTATTGCTTA TAGCTTGACC C AAGGTGC AT CTTTTAAGGC AGAAGTTTTA 

TCTTATCTTG AAAAACAAAA AATTATGAAA AATTTTTCAT TGAAAATAAA TTTTCCAACT 

GCTAAAAAAT TTATGGATAA TAAGTATTGG ATTGTAATTG CAAAAAACCA TTTAGATTCT 

CTTGTTAAGA GTAAAAATTA TTTAGTCTTG GCGAATGTAA AGATGGAATA TATACTCAAA 
AAGTTT.TTAA CTTGA 

t6-5 .nt 

TTGTTCAACTATATCTTTGGTAAAAATACCAGAAAAAGATAAAATAAATTTAACTGTTTTATCATCTTTAATGAAT 
TATCCTGATTTGAAGATTTCAAATTTTAAAATAAAAGACTACGAACATTTGCA'TTATTCATCTGATTTTGAAAGCT 
TGAGTGATACTAAAAATAGTGCTTATATTTACGTTGATGAATCTAGTTTCAATAATAATATTAATTTTATTAAAGA 
TCTTTTTATTTATAATAAGAAATTATATAGAATACTTATTGCTTATAGCTTGACCCAAGGTGCATCTTTTAAGGCA 
GAAGTTTTATCTTATCTTGAAAAACAAAAAATTATGAAAAATTTTTCATTGAAAATAAATTTTCCAACTGCTAAAA 
AATTTATGGATAATAAGTATTGGATTGTAATTGCAAAAAACCATTTAGATTCTCTTGTTAAGAGTAAAAAT 

f6-5.aa 

MKKFLISVYF LLFYGCSTIS LVKIPSKDKI NLTVLSSLMN YPDLKISNFK IKDYEHLHYS 

SDFESLSDTK NSAYIYVDES SFISTNNINFIK DLFIYNKKLY RILIAYSLTQ GAS FKAEVLS 

YLEKQKIMKN FSLKINFPTA KKFMDNKYWI VIAKNHLDSL VK3XNYLVT2A NVKMEYILKK 



CSTISLVKIPEKDKINLT^/LSSLM^^YPDLKISNFKIKDYEHLHYSSDFSSLSDTKNSAYIYVDESSFNNNINFIKD 
LFIYNKXLYRILIAYSLTQGASFKAEVLSYLEKQKIMKNFSLKINFPTAKKFMDNKYWIVIAKNHLDSLVKSKN 
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f7-30.nt 

TAGAGACGAA GTCACAAGCA AAATGTTAAA AGATTTACAA AATCAAGTTC AAGGGGGCAA 
ATAATGAAAA ATTTAAAGAC AAAAATTAAT TTTTTAGGGA TATTTTGGCT ACTGTTACTA 
TTTCTTTCTT GCGAATCAAT ACCATCACTT CCCCAAAAAC CAACCCTAAC AAACAAAGAA 
GATATTGAAA ATTTAATGCT CGATGAAGCA GAACTTTTTA GATACTCAAC CGCACTAAAT 
GTTTGGCTTT TGACTGTAAA ATCTTATGTG ATCAAATACT ATCCTAATGA CAAATTTCCT 
GTGTTTGAAA ATTTTGATCC CGTGTTTGGC GATGAAAATG GAACTAAAGA AACAAATATA 
CTAAAAAATC GAATTACCTA CTACAATCGA TACATAGAAA AAACCGAACC GATTGTATTT 
GGGTGTTACA AAAAATACAG CAGAAGATAA 

t7-30.nt 

TTGCGAATCAATACCATCACTTCCCCAAAAACCAACCCTAACAAACAAAGAAGATATTGAAAATTTAATGCTCGAT 
GAAGCAGAACTTTTTAGATACTCAACCGCACTAAATGTTTGGCTTTTGACTGTAAAATCTTATGTGATCAAATACT 
ATCCTAATGACAAATTTCCTGTGTTTGAAAATTTTGATCCCGTGTTTGGCGATGAAAATGGAACTAAAGAAACAAA 
TATACTAAAAAATCGAATTACCTACTACAATCGATACATAGAAAAAACCGAACCGATTGTATTTGGGTGTTACAAA 
AAATACAGCAGAAGA 

f7-30 .aa 

RRSHKQNVKR FTKSSSRGQI MKNLKTKINF LGIFWLLLLF LSCESIPSLP QKPTLTNKED 
IENLMLDEAE LFRYSTALNV WLLTVKSYVI KYYPNDKFPV FENFDPVFGD ENGTKETNIL 
KNRITYYNRY IEKTEPIVFG CYKKYSRR 

t7-30 .aa 

CESIPSLPQKPTLTNKEDIENLMLDEAELFRYSTALNVWLLTVKSYVIKYYPNDKFPVFENFDPVFGDENGTKETN 
ILKNRITYYNRYIEKTEPIVFGCYKKYSRR 

f76-l.nt 

TGAATATTAA TAATAAAAAA AGGAGTAACA 
TTTTTACTAA TGCTAAACGG CTGTAATTCT 
AGACAAAAGC GTGATTTAAC CCAAAAAGAA 
AAAGAAGACC TGC TT AG AG A AAAGCTATCT 
AAAACCGCTT TAACTGGTGT TGGAAAATTT 
ATTAAATCAG CACTTGAACA TATAAAGACT 
GGAAAAAACA CCTTCAAAAC TACCGTTCAA 
TTTGCAGATC AAGCAACTGC TACCTGCAAT 

t7S-l .nt 

CTGTAATTCTAATGATACAAATACCAAGCAGACAAAAAGCAGACAAAAGCGTGATTTAACCCAAAAAGAAGCAACA 
CAAGAAAAACCTAAATCTAAATCTAAAGAAGACCTGCTTAGAGAAAAGCTATCTGATC-ATCAAAAAACACAACTTG 
ACTGGTTAAAAACCGCTTTAACTGGTGTTGGAAAATTTGATAAATTCTTAGAAAATGATGAAGGCAAAATTAAATC 
AGCAC TTGAAC ATATAAAGAC TGAACTTGAT AAATGTAATGGAAATGATGAAGGAAAAAAC AC CTTCAAAACTAC C 
GTTCAAGGGTTTTTTAGCGGCGGCAATATAGATAATTTTGCAGATCAAGCAACTGCTACCTGCAAT 

f76-l .aa 

ILIIKKGVTM KIINILFCLF LLMLNGCNSN DTNTKQTKSR QKRDLTQKEA TQEKPKSKSK 
EDLLREKLSD DQKTQLDWLK TALTGVGKFD KFLENDEGKI KSALEHIKTE LDKCNGNDEG 
KNTFKTT VQG FFSGGNIDNF ADQATATCN 



ATGAAAATTA TCAACATATT ATTTTGTTTG 
AATGATACAA ATACCAAGCA GACAAAAAGC 
GCAACACAAG AAAAACCTAA ATCTAAATCT 
GATGATCAAA AAACACAACT TGACTGGTTA 
GATAAATTCT TAGAAAATGA TGAAGGCAAA 
GAAC TTGAT A AATGTAATGG AAATGATGAA 
GGGTTTTTTA GC GGCGGC AA TATAGATAAT 
TAA 



t75-l.aa 
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CNSNDTNTKQTKSRQKRDLTQKEATQEKPKSKSKEDLLREKLSDDQKTQLDWLKTALTGVGKFDKFLENDEGKIKS 
ALEHIKTELDKCNGNDEGKNTFKTTVQGFFSGGNIDNFADQATATCN 

f 8-10 .nt 



TAAGTAAGGA GAATATTTAT GAAATATAAT 
TTAACTGCTT GCAATCCAGA TTTTAACACA 
ATTTCAAATC AAGATGCAGA TTCTGATAAA 
ATAAATTTAA TAGAAAAAGC GAATGCAGAT 
GAACCTTCGG ATCAATATGG AATGTTGGCT 
CCACGGGAAT TAATATCTGA TACAGGTAGT 
TATAGTATTT TATTAAATGC TATTGAAACT 
ATACTGTCAA TAAAAGTACT AGAAATATTT 
GATGATGTGG TTGTTCACTT ATATTCCAAA 
AATTTAAAAA GACTTAAAAA TTTGTTTGAA 
AAGATGTCAA AACGTCTTTT ATTGGATTAT 
AACGCCAAGC TTGGATC TTA TGTGGTTGCA 
GAAGCAGAAA GGCTGAAAAG CGAGATAATT 

t8-10 .nc 



ACGATTATAA GCATATTTGT TTGTTTGTTT 
AATAAGAAAA GAACTCTAAG TAAGGGGATA 
ATAATAAAAA ATAAATTACT TGATGATTTA 
AGAGAAAAAT ATGTAAAAAA AATGGAAGAA 
GTTTTTGGAG GTATGTATTG GGCAGAATCA 
GAGAGATCTA TTAGGTATAG AAGGCGTGTT 
AATGAATTAA AGAAATTTTC AGAAATTAGA 
AGCCTATTTA ATCTATTTGG AAGTAC TCTT 
AAAGATACTC TAGGTAAACT AGATATTTCA 
AAATTATTAT CTATAAAAAC AATCGTTTCA 
CAAAATAATG AAAATTTTAT AAAAAC AG AT 
CTTTCCAATC AAATTCAAGA AAAATATAAT 
TTAATATATA CCCTTTAA 



TTGCAATCCAGATTTTAACACAAATAAGAAAAGAACTCTAAGTAAGGGGATAATTTCAAATCAAGATGCAGATTCT 
GATAAAATAATAAAAAATAAATTACTTGATGATTTAATAAATTTAATAGAAAAAGCGAATGCAGATAGAGAAAAAT 
ATGTAAAAAAAATGGAAGAAGAACCTTCGGATCAATATGGAATGTTGGCTGTTTTTGGAGGTATGTATTGGGCAGA 
ATCACCACGGGAATTAATATCTGATACAGGTAGTGAGAGATCTATTAGGTATAGAAGGCGTGTTTATAGTATTTTA 
TTAAATGCTATTGAAACTAATGAATTAAAGAAATTTTCAGAAATTAGAATACTGTCAATAAAAGTACTAGAAATAT 
TTAGCCTATTTAATCTATTTGGAAGTACTCTTGATGATGTGGTTGTTCACTTATATTCCAAAAAAGATACTCTAGG 
TAAACTAGATATTTCAAATTTAAAAAGACTTAAAAATTTGTTTGAAAAATTATTATCTAT AAAAAC AATCGTTTCA 
AAGATGTCAAAACGTCTTTTATTGGATTATCAAAATAATGAAAATTTTATAAAAACAGATAACGCCAAGCTTGGAT 
CTTATGTGGTTGCACTTTCCAATCAAATTCAAGAAAAATATAATGAAGCAGAAAGGCTGAAA 



f8-10 .aa 

VRRIFMKYNT 
NLIEKANADR 
SILLNAIETN 
LKRLKNLFEK 
AERLKSEIIL 



IISIFVCLFL 
EKYVKKMEEE 
ELKKFSEIRI 
LLSIKTIVSK 
IYTL 



TACNPDFNTN 
PSDQYGMLAV 
LSIKVLEIFS 
MSKRLLLDYQ 



KKRTLSKGII 
FGGMYWAESP 
LFNLFGSTLD 
NNENFIKTDN 



SNQDADSDKI 
RELISDTGSE 
DVWHLYSKK 
AKLGSYWAL 



IKNKLLDDLI 
RSI RYRRRVY 
DTLGKLDISN 
SNQIQEKYNE 



c8-10 .aa 



CNPDFNTNKKRTLS KG I I SNQDADSDKI IKNKLLDDL INL IEKANADREKYVKKMEEEPSDQYGMLAVFGGMYWAE 
SPRELISDTGSERSIRYRRRVYSILLNAIETNELKKFSEIRILSIKVLEIFSLFNLFGSTLDDVWHLYSKKDTLG 
KLDISNLKRLKNLFEKLLSIKTIVSKMSKRLLLDYQNNENFIKTDNAKLGSYWALSNQIQEKYNEAERLK 

f8-14.nt 



TAATATATAT TCTTGATTAA GGGAAAGGAG 
TATACATTGT TAACGATAGG ATTGATGTCT 
AAAGAGGAAC AAAAAAATAA CAATGATATA 
GCTATTAATA ATTTATATGG AAATAAAAAA 
AAATTGAAAG ACAAGGGTTT AGACGTGACC 
TCCGTAGAAT CTGCGGTGTC TTTAGGAGAA 
TCAATTGAGC ATAATCAAAA AAAAGAGATA 
GAAGAAAAGC AAGCGGATAA AGCAATTAAA 



AGTATTTTTA TGAAAAAAAA AATGTTTTTA 
TGTAATCTAA ATTCTAAATT ATCTGGTAAT 
AAAGAAGCTT TAAATGGCGT TCAAGAAAAT 
GAAAAAAAAG ATTTTATTAA AAATTCGGAA 
ACCCTCCCCT TAGAACC TGT AGTGGCGCCC 
TCTAATAATA GGATTGGTAT ACCAACCATT 
AAAGAAGAGG ATTTTTTCCC TTCTACTGA.G 
GATATAGAGA ATCTTATTGG AGAATCTGGA 



* 
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TTTCCCGAGT TAATTGAGAA TGTGTGCTCA CTTAAACATG AATATACTTT AATAAGAAGT 
GATTTTTATG ATGTGATAAC TAAGATTCAG AATAAAAAAA TATCACTAAT GAAAAATTCT 
CATAATAATA GAAATAAAAT AAGGGAACTA GTACAATTGC AAAATAATTT AAAGATAGGA 
GACGAACTTG ATAAAATTAT GGGTTGCATT GAT AC TGCAG AACAAGAGAT AAGATCTGCC 
GCTTTCTTTT TTGATGAAGC TAAGGAAAGC TTAAAAGAAG GTATTATTAA AAGATTGGAA 
AAAAGTAAAA ATAGGGCAGC ATCACAATTA TCTAAAAAGG CTTTAAATAG AGC AGAGGAT 
GCTTTAAGGT GCTTAGAAAA TTATTCTTCT AAAAAAGGTG AGGCAATAGG AAGAAGAAGC 
TTTATAAAAG AAGTTGTTGA ACAGGCAAAA AATGCTTTAA GTAAGTCTTA A 



TTGTAATCTAAATTCTAAATTATCTGGTAATAAAGAGGAACAAAAAAATAACAATGATATAAAAGAAGCTTTAAAT 
GGCGTTCAAGAAAATGCTATTAATAATTTATATGGAAATAAAAAAGAAAAAAAAGATTTTATTAAAAATTCGGAAA 
AATTGAAAGACAAGGGTTTAGACGTGACCACCCTCCCCTTAGAACCTGTAGTGGCGCCCTCCGTAGAATCTGCGGT 
GTCTTTAGGAGAATCTAATAATAGGATTGGTATACCAACCATTTCAATTGAGCATAATCAAAAAAAAGAGATAAAA 
GAAGAGGATTTTTTCCCTTCTACTGAGGAAGAAAAGCAAGCGGATAAAGCAATTAAAGATATAGAGAATCTTATTG 
GAGAATCTGGATTTCCCGAGTTAATTGAGAATGTGTGCTCACTTAAACATGAATATACTTTAATAAGAAGTGATTT 
TTATGATGTGATAACTAAGATTCAGAATAAAAAAATATCACTAATGAAAAATTCTCATAATAATAGAAATAAAATA 
AGGGAACTAGTACAATTGCAAAATAATTTAAAGATAGGAGACGAACTTGATAAAATTATGGGTTGCATTGATACTG 
CAGAACAAGAGATAAGATCTGCCGCTTTCTTTTTTGATGAAGCTAAGGAAAGCTTAAAAGAAGGTATTATTAAAAG 
ATTGGAAAAAAGTAAAAATAGGGCAGCATCACAATTATCTAAAAAGGCTTTAAATAGAGCAGAGGATGCTTTAAGG 
TGCTTAGAAAATTATTCTTCTAAAAAAGGTGAGGCAATAGGAAGAAGAAGCTTTATAAAAGAAGTTGTTGAACAGG 
CAAAAAATGCTTTAAGTAAGTCT 



YIFLIKGKES IFMKKKMFLY TLLTIGLMSC NLNSKLSGNK EEQKNNNDIX EALNGVQENA 

INNLYGNKKE KKDFIKNSEK LKDKGLDVTT LPLEPWAPS VSSAVSLGES NNRIGIPTIS 

IEHNQKKEIK EEDFFPSTEE EKQADKAI KD IENLIGESGF PELIENVCSL KHEYTLIRSD 

FYDVITKIQN KKISLMKNSH NNRNKIRELV QLQNNLKIGD ELDKIMGCID TAEQEIRSAA 

FFFDEAKESL KEGIIKRLEK SKNRAASQLS KKALNRAEDA LRCLENYSSK KGEAIGRRSF 
IKEWEQAKN ALSKS 



CNLNSKLSGNKEEQKNHNDIKEALNGVQENAI 

SLGESNNRIGIPTISIEHNQKKEIKEEDFFPSTEEEKQADKAIKDIENLIGESGFPELISNVCSLKHEYTLIRSDF 
YDVITKIQNKKISLMKNSHNNRNKIRELVQLQNNLKIGDELDKIMGCIDTAEQEIRSAAFFFDEAKESLKEGIIKR 
LEKSKWRAASQLSKKALNRAEDALRCLENYSSKKGEAIGRRSF IKEWEQAKN ALSKS 



TGATTAATTTTTTTTAAGGATTACGTTTTGAAAAGAAACAAAATTTGGAAAACGTTAAAACTGTTTCAAATAACTT 
TACTGTTCTCATGCTCTTTTTATTCTAAATCAAACAACACAGAAGCGATAAGTGAATTACAATCAAGCCCTATTAA 
ACTTGGAAAAATTAAAGTTTTACAAAAAACAGAAAAGATTGTAAGCACCCAAAATCTTCAAAACTTACAACAAAGC 
CAGTTCTTTAAAAATGAAAAAGAAAAAATAATTAAAAAAATTGCACAAGAATTTGATGAGAATGAAAAATTGATTA 
ATAAAATAGGTCCAAATATCGAAATGTTTGCTCAAACAATAAACACGGATATTCAAAAAATCGAACCTAATGATCA 
ATTTGGAATAAATAAAACTTTATTCACAGAAAAAAAAGACAATAATATTGACTTTATGTTAAAAGACAATCGACTT 
AGAAGATTATTTTACTCATCTTTAAATTATGATGAAAATAAAATCAAAAAATTAGCCACAATACTCGCGCAAACAT 
CAAGCTCAAACGACTACCATTACACACTTATTGGTTTAATTTTTTGGACAGGATTTAAAATCCAAGAAC-CATTTGA 
AAGCGCTGTTAATATTTTAACTAAAGACGAGCAAAAGCGCCTAATTTTTAATTTTAGAACAAAAACAGTAAAAGAG 
ATTCAGGAAAATTTTGAAAAACTAATGCAAGAGAGAAATTCATGGATAAAAATCGTCGATAACATTATTGGCGAAT 
ATGACAAAAATACGGGAGGATGCAAAGCTGATGGAAAAATTCTCGGAGAAGTAATAAGGGTTGGATACGAGCATGA 
ACTCGACTCAAATAAAAGTATGCAAATTTTAAACAATATTGAAACACCGCTAAAAACCTGTTGTGACCACATACAC 
TACTAA 



t8-14 .nt 



f8-14.aa 



t8-14 . aa 



f OlA.nc 



3B001 
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tOlA.nt 



BB001 



TGCTCTTTTTATTCTAAATCAAACAACACAGAAGCGATAAGTGAATTACAATCAAGCCCTATTAAACTTGGAAAAA 
TTAAAGTTTTACAAAAAACAGAAAAGATTGTAAGCACCCAAAATCTTCAAAACTTACAACAAAGCCAGTTCTTTAA 
AAATGAAAAAGAAAAAATAATTAAAAAAATTGCACAAGAATTTGATGAGAATGAAAAATTGATTAATAAAATAGGT 
CCAAATATCGAAATGTTTGCTCAAACAATAAACACGGATATTCAAAAAATCGAACCTAATGATCAATTTGGAATAA 
ATAAAACTTTATTCACAGAAAAAAAAGACAATAATATTGACTTTATGTTAAAAGACAATCGACTTAGAAGATTATT 
TTACTCATCTTTAAATTATGATGAAAATAAAATCAAAAAATTAGCCACAATACTCGCGCAAACATCAAGCTCAAAC 
GACTACCATTACACACTTATTGGTTTAATTTTTTGGACAGGATTTAAAATCCAAGAAGCATTTGAAAGCGCTGTTA 
ATATTTTAACTAAAGACGAGCAAAAGCGCCTAATTTTTAATTTTAGAACAAAAACAGTAAAAGAGATTCAGGAAAA 
TTTTGAAAAACTAATGCAAGAGAGAAATTCATGGATAAAAATCGTCGATAACATTATTGGCGAATATGACAAAAAT 
ACGGGAGGATGCAAAGCTGATGGAAAAATTCTCGGAGAAGTAATAAGGGTTGGATACGAGCATGAACTCGACTCAA 
ATAAAAGTATGCAAATTTTAAACAATATTGAAACACCGCTAAAAACCTGTTGTGACCACATACACTAC 

fOlA.aa BB001 

LIFFKDYVLKRNKIWKTLKLFQITLLFSCSFYSKSNNTEAISELQSSPIKLGKIKVLQKTEKIVSTQNLQNLQQSQ 
FFKNEKEKI I KKI AQEFDENEKLINKIGPNI EMFAQT INTDI QKIE PNDQFG INKTLFTEKKDNNIDFMLKDNRLR 
RLFYSSLNYDENKIKKLATILAQTSSSNDYHYTLIGLIFWTGFKIQEAFESAVNILTKDEQKRLIFNFRTKTVKEI 
QENFEKLMOEFJSSWIKIVDNIIGEYDKNTGGCKAQGKILGEVIRVGYEHELDSNKSMQILNNIETPLKTCCDHIHY 

tOlA.aa BB001 

CSFYSKSNNTEAISELQSSPIKLGKIKVLQKTEKIVSTQNLQNLQQSQFFKNEKEKIIKKIAQEFDENEKLINKIG 
PNIEMFAQTINTDIQKISPNDQFGINKTLFTEKKDNNIDFMLKDNRLRRLFYSSLNYDENKIKKLATILAQTSSSN 
DYHYTLIGLIFWTGFKIQEAFESAVNILTKDEQKRLIFNFRTKTVKEIQENFEKLMQERNSWIKIVDNIIGEYDKN 
TGGCKADGKILGEVIRVGYEHELDSNKSMQILNNIETPLKTCCDHIHY 

f02A.nt BB002 

TAATTAATACTGGTTTTAATTTATAAGGAGAGTATTTTGAAAAAAGCCAAACTAAATATAATCAAGATTAATATTA 
TTACAATGATATTAACTTTAATTTGCATCTCATGTGCACCTTTTAACAAAATCAATCCCAAGGCAAATGAAAACAC 
CAAGCTTAAAAAAAACACCAGACTGAAAAAACCCGCCAATCCAGGGGAAAACATCCAAAATTTTAAAGATAAATCT 
GGAGACCTTGGCGCTTCTGATGAAAAATTTATGGGAACTACCGCTTCAGAGCTAAAAGCAATTGGTAAGGAGCTAG 
AAGATCGAAAAAATCAATACGATATACAAATAGCCAAAATTACTAATGAAGAATCTAACCTATTAGATACTTATAT 
TCGGGCTTATGAACTAGCTAACGAAAATGAAAAAATGCTTTTAAAAAGATTTCTTCTTTCATCTTTAGATTATAAA 
AAAGAJVAACATAGAGACATTAAAAGAAATTCTTGAAAAACTCATAAATAATTACGAAAACGACCCCAAAATTGCTG 
CAAATTTCCTTTATCGCATAGCGCTGGATATTCAATTAAAACTGGAAAAGCACTTAAAATCAATAAATGAAAAACT 
GGACACTCTAAGCAAAGAAAATTCAAAAGAAGATTTAGAGGCGTTGCTAGAACAAGTAAAATCTGCCTTACAGCTA 
CAAGAAAAGTTTAAAAAAACCCTAAACAAAACTCTTGAAGATTACCGTAAAAATACTAACAACATTCAAGAAAATA 
AAGTACTAGCAGAACACTTTAATAAATATTACAAAGACTCTGATTCTTTACAATCTGCCTTTTATTAA 

t02A.nt BB002 

TGTGCACCTTTTAACAAAATCAATCCCAAGGCAAATGAAAACACCAAGCTTAAAAAAAACACCAGACTGAAAAAAC 
CCGCCAATCCAGGGGAAAACATCCAAAATTTTAAAGATAAATCTGGAGACCTTGGCGCTTCTGATGAAAAATTTAT 
GGGAACTACCGCTTCAGAGCTAAAAGCAATTGGTAAGGAGCTAGAAGATCGAAAAAATCAATACGATATACAAATA 
GCCAAAATTACTAATGAAGAATCTAACCTATTAGATACTTATATTCGGGCTTATGAACTAGCTAACGAAAATGAAA 
AAATGCTTTTAAAAAGATTTCTTCTTTCATCTTTAGATTATAAAAAAGAAAACATAGAGACATTAAAAGAAATTCT 
TGAAAAACTCATAAATAATTACGAAAACGACCCCAAAATTGCTGCAAATTTCCTTTATCGCATAGCGCTGGATATT 
CAATTAAAACTGGAAAAGCACTTAAAATCAATAAATGAAAAACTGGACACTCTAAGCAAJ\GAAAATTCAAAAGAAG 
ATTTAGAGGCGTTGCTAGAACAAGTAAAATCTGCCTTACAGCTACAAGAAAAGTTTAAAAAAACCCTAAACAAAAC 
TCTTGAAGATTACCGTAAAAATACTAACAACATTCAAGAAAATAAAGTACTAGCAGAACACTTTAATAAATATTAC 
AAAGACTCTGATTCTTTACAATCTGCCTTTTAT 



f02A.aa BB002 



211 

TABLE 1 . Nucleotide and Amino Acid Sequences 

' LILVLIYKESILKKAKLNIIKINIITMILTLICISCAPFNKINPKANENTKLKKNTRLKKPANPGENIQNFKDKSG 
DLGASDEKFMGTTASELKAIGKELEDRKNQYDIQIAKITNEESNLLDTYIRAYELANENEKMLLKRFLLSSLDYKX 
ENIETLKEILEKLINNYENDPKIAANFLYRIALDIQLKLEKHLKSINEKLDTLSKENSKEDLEALLEQVKSALQLQ 
EKFKKTLNKTLEDYRKNTNNIQENKVLAEKFNKYYKDSDSLQSAFY 

t02A.aa BB002 

CAPFNKINPKANENTKLKKNTRLKKPANPGENIQNFKDKSGDLGASDEKFMGTTASELKAIGKELEDRKNQYDIQI 
AKITNEESNLLDTYIRAYEI^JSrENEKMLLKRFLLSSLDYKKENIETLKEILEKLINKryENDPKIAANFLYRIALDI 
QLKLEKHLKSINEKLDTLSKENSKEDLEALLEQVKSALQLQEKFKKTLNKTLEDYRKNTlSraiQENKVLAEHFNKYY 
KDSDSLQSAFY 

f03A.nt BB006 

TGATTTAATGTAAATTTTAATTACCGCCTAAAAAAGGCTTTAAATGGTATAAAGGAAGAAGATCTAATGGTATTTA 
GAACATATAAACATTTGGAACTAATAATGCTGCCCATGTTAATGCTGAGTTGCGCTTTTTTTAAGAAACCACAATC 
TGTACATCAAGACAGCAATACTGGCAAACCAATAAGCGATGAAAAATTACATTTAATATCAGGCAAAATTTCAAAT 
AAAAAATTGCCAATCATAAATAGTAATCATGACGTAACTTGGATAAAAACAAAGGCAATGACAATCTTAGGCGAAG 
ATGGAAAAGAAATACCAGAATTTAAAAACAAATTTGGATATTCTTATATAATATCTCCTGTAAAAATGGATGGAAA 
ATATAGTTATTACGCGTCATTATTAATACTTTTTGAAACAACTAAAAATGGAGATGATGAATATGAAATTGAAGAT 
GTTAAATTTGTAACAGCTGGTTCCACCCTAGAACTTAAAAATTCTCTTTTAGCTGTTGAAAATTCACAAGAAGAAG 
GATATGTTACTGCATACCCATTTGGAATATTGATGAGTGACGAGATTAAAAATGCTTTTAAATTAACATATAAAAA 
TGGTCATTGGAATTATATGCTTGCAGATTTAACTGTCAAAAATAAACTTACTCAAGAAACTAAAATTTATAAAATT 
TCTCTTAATTCAAAATTAATTATTGAATTTTTAAAAGAAGTGCTAAAAGAAAATTCTATATTAAAAGACATAGCTG 
GAGATTTATTTGAAGATATATAA 

t03A.nt BB006 

TGCGCTTTTTTTAAGAAACCACAATCTGTACATCAAGACAGCAATACTGGCAAACCAATAAGCGATGAAAAATTAC 
ATTTAATATCAGGCAAAATTTCAAATAAAAAATTGCCAATCATAAATAGTAATCATGACGTAACTTGGATAAAAAC 
AAAGGCAATGACAATCTTAGGCGAAGATGGAAAAGAAATACCAGAATTTAAAAACAAATTTGGATATTCTTATATA 
ATATCTCCTGTAAAAATGGATGGAAAATATAGTTATTACGCGTCATTATTAATACTTTTTGAAACAACTAAAAATG 
GAGATGATGAATATGAAATTGAAGATGTTAAATTTGTAACAGCTGGTTCCACCCTAGAACTTAAAAATTCTCTTTT 
AGCTGTTGAAAATTCACAAGAAGAAGGATATGTTACTGCATACCCATTTGGAATATTGATGAGTGACGAGATTAAA 
AATGCTTTTAAATTAACATATAAAAATGGTCATTGGAATTATATGCTTGCAGATTTAACTGTCAAAAATAAACTTA 
CTCAAGAAACTAAAATTTATAAAATTTCTCTTAATTCAAAATTAATTATTGAATTTTTAAAAGAAGTGCTAAAAGA 
AAATTCTATATTAAAAGACATAGCTGGAGATTTATTTGAAGATATA 

f03A.aa BB006 

FNVNFNYRLKKALNGIKEEDLWFRTYKHLELIMLPMLMLSCAFFKKPQSVHQDSNTGKPISDEKLHL^ 
KLPIINSNHDVTWIKTKAMTILGEDGKEIPEFKNKFGYSYIISPVKMDGKYSYYASLLILFETTKNGDDEYEIEDV 
KFVTAGSTLELKNSLLAVENSQEEGWTAYPFGILMSDEIKNAFKLTYKNGHWmfMLADLTVKNKLTQETKIYKIS 
LNSKL I I EFLKEVLKENS I LKD I AGDLFEDI 

t03A.'aa BB006 

CAFFKKPQSVHQDSNTGKPISDEKLHLISGKISNKKLPIINSNHDVTWIKTKAMTILGEDGKEI-PEFKNKFGYSYI 
ISPVKMDGKYSYYASLLILFSTTKNGDDEYEIEDVKFVTAGSTLELKNSLLAVENSQEEGYVTAYPFGILMSDEIK 
NAFKLTYKNGHWNYMLADLTVKNKLTQETKIYKISLNSKLIIEFLKEVLKSNSILKDIAGDLFSDI 

f04A.nt BB011 

TAATTACCAAAGATAAGTAAACTTGCAAATAAAACTACACGTATTGAAAGTAGATTTGAAATTTCCATTATATTTA 
TATATAATGGCACTAAATATCTGAAAATGAAGGAGAAGCGGGTGGGCAATAAAATTTTTTATATTTCAGTGGTTTT 
AATTTTAATAGTTGGTTGCGACTGGGGAACTATTAAAGATAAAAGTACAGAAATTTCCAAGCTATTAAGAACGGAC 



212 

TABLE 1 . Nucleotide and Amino Acid Sequences 

AAAGAT AAG AC T AAAAAT CAAGAT AG AAT AG AATTGGGTGAAGAT AATTT TGT ATC T AAAAAT AAT ATGT C TAC T A 
CTGATACGGGCATTACTAGTTTAGGAAGTCTAAACAACTTGGATTTAATTAATCGTTCACAGCGGGTCAGTGAACC 
ACCTATAATCTCAAATGAGAAAGCCATAGCTACTCAAGCAAAAGTAGATTTAATGAACAACATTAATGTTACTATA 
ATAAACCCAAAACCAGCTCAAAATTTGGGAAATTCTTTAAACAATACTACTACTGAAGATAGTGTGAAGTTTTTAT 
CAATTGAAAACCAAGAGTGGCTTATTAGTAAAAAGATTTTGCCCAGTAAGTTGGAAAATTTAGAAAGCTTTCTAAA 
AACACAACACGAAAAAGAAGCTTTTAAGACGGCTAAAACTATACAAAGTCTCATTAGTAATTCCAATATGGGTAAA 
GAAATTATTAAGTTTAAGGAAGAATATTACAAACTTTATAATTTGTTTGAAGGCATACAACAAAAATTCCATAGTC 
AAAGGAATTCATTTATAAAAGATACTAAATTTGGGGAAAATAGACAAAAAAATGCAGTTATATTTAAATCCTTTTC 
ATC TATA.GAGAAAGAAATT AGAGATTTGAATTATAAGTTGNGTGAAATC C AAAGTAATTTT C AAATTGC AGATGTT 
AGCTGGAATAATGCAAACTCTCTTTTAAAAGAATCTATAGAAAAATTAATTCAGGCAATTGAAAAAAGGTATGACA 
ATGAGAGTAGAAAGCAAGGTCAAATTGGTGGACCTGCTAATAGATGGGATAAAAATCAAGCTGACAATTTTGCTAA 
GGATGCAAAGTATAAGGCAGAACATTCAGCAAATGATTTGGAAAATGCAGCCAACTATTTTAGATATAGTTGTTCA 
AATGAAAAAGAAGC TAAAAAGCTATTAGAAGAAATT AAAAAAAGATTTGT ACGAATTGGT ATT AGC C TATAA 

t04A.nt BB011 

TGCGACTGGGGAACTATTAAAGATAAAAGTACAGAAATTTCCAAGCTATTAAGAACGGACAAAGATAAGACTAAAA 

ATCAAGATAGAATAGAATTGGGTGAAGATAATTTTGTATCTAAAAATAATATGTCTACTACTGATACGGGCATTAC 

TAGTTTAGGAAGTCTAAACAACTTGGATTTAATTAATCGTTCACAGCGC-GTCAGTGAACCACCTATAATCTCAAAT 

GAGAAAGCCATAGCTACTCAAGCAAAAGTAGATTTAATGAACAACATTAATGTTACTATAATAAACCCAAAACCAG 

CTCAAAATTTGGGAAATTCTTTAAACAATACTACTACTGAAGATAGTGTGAAGTTTTTATCAATTGAAAACCAAGA " 

GTGGCTTATTAGTAAAAAGATTTTGCCCAGTAAGTTGGAAAATTTAGAAAGCTTTCTAAAAACACAACACGAAAAA 

GAAGCTTTTAAGACGGCTAAAACTATACAAAGTCTCATTAGTAATTCCAATATGGGTAAAGAAATTATTAAGTTTA 

AGGAAGAATATTACAAACTTTATAATTTGTTTGAAGGCATACAACAAAAATTCCATAGTCAAAGGAATTCATTTAT 

AAAAGATACTAAATTTGGGGAAAATAGACAAAAAAATGCAGTTATATTTAAATCCTTTTCATCTATAGAGAAAGAA 

ATTAGAGATTTGAATTATAAGTTGNGTGAAATCCAAAGTAATTTTCAAATTGCAGATGTTAGCTGGAATAATGCAA 

ACTCTCTTTTAAAAGAATCTATAGAAAAATTAATTCAGGCAATTGAAAAAAGGTATGACAATGAGAGTAGAAAGCA 

AGGTCAAATTGGTGGACCTGCTAATAGATGGGATAAAAATCAAGCTGACAATTTTGCTAAGGATGCAAAGTATAAG 

GCAGAACATTCAGCAAATGATTTGGAAAATGCAGCCAACTATTTTAGATATAGTTGTTCAAATGAAAAAGAAGCTA 

AAAAGCTATTAGAAGAAATTAAAAAAAGATTTGTACGAATTGGTATTA.GCCTA 

f04A.aa BB011 

LPKISKLANKTTRIESRFEISIIFIYNGTKYLKMKEKRVGNKIFYISWLILIVGCDWGTIKDKSTEISKLLRTDK 
DKTKNQDRIELGEDNFVSKMWIST^^ 

NPKPAQNLGNSLNNTTTEDSVKFLSIENQEWLISKKILPSKLENLESFLKTQHEKEAFKTAKTIQSLISNSNMGKE 
IXKFKEEYYKLYNLFEGIQQKFHSQFJNTSFIKDTKFGENRQKNAVIFKSFSSIEKEIRDLNYKLXEIQSNFQIADVS 
WNNANSLLKESIEKLIQAIEKRYDNESRKQGQIGGPANRWDKNQADNFAKDAKYKAEHSANDLENAAiNrYFRYSCSN 
EKEAKKLLEEIKKRFVRIGISL 

t04A.aa BB011 

CDWGTIKDKSTEISKLLRTDKDKTKNQDRIELGEDNFVSKIJNMSTTDTGITSLGSLNNLDLINRSQRVSEPPIISN 
EKAIATQAKVDLMNWIWTIINPKPAQNLGNSLNWTTTEDSVKFLSIENQEWLISKKILPSKLENLESFLKTQHEK 
EAFKTAKTIQSLISNSNMGKEIIKFKEEYYKLYNLFSGIQQKFHSQRNSFIKDTKFGENRQKNAVIFKSFSSIEKS 
IRDLOTKLXEIQSNFQIADVSWSJNANSLLKESIEKLIQAIEKRYDNESRKQGQIGGPANRWDKNQADNFAKDAKYK 
AEHSANDLENAANYFRYSCSNEKEAKKLLEEIKKRFVRIGISL 

fOSA.nt BB009 

TAAATAAATTGTAGGATAAAAATGAAACAAAAATACGAAAACTATTTTAAAAAAAGATTAATTTTAAACCTATTAA 
TATTTTTACTACTAGCATGCTCAAGCGAATCCATATTTTCACAATTAGGAAATCTGCAAAAAATAAAACATGAATA 
CAATATTTTGGGCAGTTCAAGTCCAAGAGGAATTTCTCTAGTAGGAGAAACTCTCTACATTGCAGCCATGCATTTA 
TTTAAAAAAGAAAACGGCAAGATTGAAAAAATTGATTTGAGCAATTCTTATGAGTTTATAAACGACATTGTAAATA 
TATCTGGAAAAACCTATCTTTTAGCGCAAAACAAAGAAGAAGAATTAGAAGTTTGCGAGCTAAATGGAAAAGATTG 
GACATTAAAATTTAAAAAACCGCTAAAAGCATATAAATTCTTAAAATCCGTAGAAGAGATGGCGTAA 
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t05A.nt BB009 

TGCTCAAGCGAATCCATATTTTCACAATTAGGAAATCTGCAAAAAATAAAACATGAATACAATATTTTGGGCAGTT 
CAAGTCCAAGAGGAATTTCTCTAGTAGGAGAAACTCTCTACATTGCAGCCATGCATTTATTTAAAAAAGAAAACGG 
CAAGATTGAAAAAATTGATTTGAGCAATTCTTATGAGTTTATAAACGACATTGTAAATATATCTGGAAAAACCTAT 
CTTTTAGCGCAAAACAAAGAAGAAGAATTAGAAGTTTGCGAGCTAAATGGAAAAGATTGGACATTAAAATTTAAAA 
AAC CGC TAAAAGC AT ATAAATTC TT AAAATCC GTAGAAGAGATGGCG 

f05A.aa BB009 

INCRIKMKQKYENYFKKRLILNLLIFLLLACSSESIFSQLGNLQKIKHEYNILGSSSPRGISLVGETLYIAAMHLF 
KKENGKIEKIDLSNSYSFINDIVNISGKTYLLAQNKEEELEVCELNGKDWTLKFKKPLKAYKFLKSVEEMA 

t05A.aa BB009 

CSSESIFSQLGNLQKIKHEYNILGSSSPRGISLVGETLYIAAMHLFKKSNGKIEKIDLSNSYEFINDIVNISGKTY 
LLAQNKEEELEVCELNGKDWTLKFKKPLKAYKFLKSVEEMA 

f06A.nt BB014 ' 

TAAGGAGCATATATGAGGATTTTGGTTGGCGTTTGTATAATAGCATTGGCTTTATTGGGTTGTTATTTGCCTGATA 
ATCAGGAACAAGCTGTTCAAACTTTTTTTGAGAATTCGGAAAGTAGTGATATGGGTTCCGATGAGATTGTTACTGA 
AGGCATATTTTCTAGTTTAAAATTATATGCGTCTGAACATCGTTTATTGGTTGAGATAAAAAAGACTTTAATTAGT 
TTAAAAGATCCTAATTATCNNGNTGTAGTACNCCCAGTGAGTGACTATAATGAGGAGTATTTTAATAAATTCTTTC 
TAGATTTAGGGTCTGAGCAATCTAAAGACCTGATTAAGTTGTTTATTATGGTAAAAAATGAGCAGAACAATAATAA 
ATTTATGCGTATAGTTCGTTGGCTGTATTCATGTATAGAGGAGTTATATTCTCTAGATATTAAGTATTCTGGCGAG 
GGGAGCCATGAGTATAATCGTAATATGCCTAGACCCACTGCTTATGAACAATATTTAAAAGTGAAGAGGTATGATT 
ATAATAGCCCAGTTTCTATTTTACCTACATAA 

t0 6A.nt BB014 

TGTTATTTGCCTGATAATCAGGAACAAGCTGTTCAAACTTTTTTTGAGAATTCGGAAAGTAGTGATATGGGTTCCG 
ATGAGATTGTTACTGAAGGCATATTTTCTAGTTTAAAATTATATGCGTCTGAACATCGTTTATTGGTTGAGATAAA 
AAAGACTTTAATTAGTTTAAAAGATCCTAATTATCNNGNTGTAGTACMCCCAGTGAGTGACTATAATGAGGAGTAT 
TTTAATAAATTCTTTCTAGATTTAGGGTCTGAGCAATCTAAAGACCTGATTAAGTTGTTTATTATGGTAAAAAATG 
AGCAGAACAATAATAAATTTATGCGTATAGTTCGTTGGCTGTATTCATGTATAGAGGAGTTATATTCTCTAGATAT 
TAAGTATTCTGGCGAGGGGAGCCATGAGTATAATCGTAATATGCCTAGACCCACTGCTTATGAACAATATTTAAAA 
GTGAAGAGGTATGATTATAAT 

f06A.aa BB014 

GAYMRILVGVCIIALALLGCYLPDNQEQAVQTFFENSESSDMGSDEIVTEGIFSSLKLYASEHRLLVEIKKTLISL 
KDPNYXXVVXPVSDYNEEYFNKFFLDLGSEQSKDLIKLFIWKNEQNMJKFMRIVRWLYSCIEELYSLDIKYSGEG 
SHEYNRNMPRPTAYEQYLKVKRYDYNS PVS ILPT 

t06A.aa BB014 

CYLPDNQEQAVQTFFENSESSDMGSDEIVTEGIFSSLKLYASEHRLLVEIKKTLISLKDPNYXXWi'PVSDYNEEY 

FNKFFLDLGSEQSKDLIKLFIjWKJSEQMSn^ 

VKRYDYN 

f07A.nt BB023 

TAAAGTATTTTATTTTTTTTATTATCCACTGTTCTTTTTGCTCAAGAGACTGATGGATTAGCAGAGGGTTCTAAAA 
GGGCAGAGCCTGGAGAATTAGTTTTAGATTTTGCCGAGCTTGCAAGAGATCCAAGTTCAACTAGACTTGATCTVAC 
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AAATTATGTTGATTATGTATATTCGGGCGCTTCTGGTATTGTTAAGCCGGAAGATATGGTTGTAGATCTTGGGATA 
AATAATTGGAGCGTTTTACTTACTCCTTCTGCAAGGTTGCAGGCTTACGTTAAAAATTCAGTTGTTGCGCCCGCTG 
TTGTTAAGAGTGAGTCAAAAAGGTACGCAGGTGATACTATTTTAGGGGTAAGAGTTTTGTTTCCAAGCTATTCTCA 
ATCATCTGCTATGATTATGCCACCATTTAAAATTCCTTTTTATTCAGGGGAAAGTGGCAATCAATTTTTAGGCAAA 
GGTCTTATTGATAACATTAAAACCATGAAAGAAATTAAGGTATCTGTTTATAGTTTAGGGTATGAGATAGATCTTG 
AGGTTTTATTTGAAGATATGAATGNCATGGAATATGCTTNNTCTATGGGTACTTTAAAGTTTAAAGGGTGGGCTGA 
TTTAATTTGGTCAAATCCTAACTATATTCCTAATATATCATCCAGAATTATTAAAGACGATGTTCCAAATTATCCT 
CTTGCTTCAAGTAAAATGAGATTTAAGGCTTTTAGAGTTTCAAAGTCACACAGTTCAAAAGAGCAAAATTTCATCT 
TTTATGTTAAAGATTTAAGAGTTCTTTATGATAAGTTGAGTGTTTCAATAGATTCTGATATTGACAGTGAGTCTGT 
ATTTAAAGTTTATGAGACTAGCGGAACTGAATCCCTTCGTAAATTAAAGGCACACGNAACNTTTAAAAGNGTTTTA 
AAGCTTAGAGAAAAAATTTCTATGCCTGAAGGCTCTTTCCAAAACTTTGTAGAAAAGATTGAGAGTGAAAAACCTG 
AAGAATCATCTCCGAAAAATTAG 

t07A.nt BB023 

GAGGGTTCTAAAAGGGCAGAGCCTGGAGAATTAGTTTTAGATTTTGCCGAGCTTGCAAGAGATCCAAGTTCAACTA 
GACTTGATCTTACAAATTATGTTGATTATGTATATTCGGGCGCTTCTGGTATTGTTAAGCCGGAAGATATGGTTGT 
AGATCTTGGGATAAATAATTGGAGCGTTTTACTTACTCCTTCTGCAAGGTTGCAGGCTTACGTTAAAAATTCAGTT 
GTTGCGCCCGCTGTTGTTAAGAGTGAGTCAAAAAGGTACGCAGGTGATACTATTTTAGGGGTAAGAGTTTTGTTTC 
CAAGCTATTCTCAATCATCTGCTATGATTATGCCACCATTTAAAATTCCTTTTTATTCAGGGGAAAGTGGCAATCA 
ATTTTTAGGCAAAGGTCTTATTGATAACATTAAAACCATGAAAGAAATTAAGGTATCTGTTTATAGTTTAGGGTAT 
GAGATAGATCTTGAGGTTTTATTTGAAGATATGAATGNCATGGAATATGCTTNNTCTATGGGTACTTTAAAGTTTA 
AAGGGTGGGCTGATTTAATTTGGTCAAATCCTAACTATATTCCTAATATATCATCCAGAATTATTAAAGACGATGT 
TCCAAATTATCCTCTTGCTTCAAGTAAAATGAGATTTAAGGCTTTTAGAGTTTCAAAGTCACACAGTTCAAAAGAG 
CAAAATTTCATCTTTTATGTTAAAGATTTAAGAGTTCTTTATGATAAGTTGAGTGTTTCAATAGATTCTGATATTG 
ACAGTGAGTCTGTATTTAAAGTTTATGAGACTAGCGGAACTGAATCCCTTCGTAAATTAAAGGCACACGNAACNTT 
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2. Closest matching sequences between the polypeptides of the present invention and sequences in GenBank and Derwent c 
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TABLE 3. Conservative Amino Acid Substitutions. 



Hydrophobic 



Phenylalanine 

Tryptophan 

Tyrosine 

Leucine 

Isoleucine 

Valine 

Glutamine 
Asparagine 

Arginine 

Lysine 

Histidine 

Aspartic Acid 
Glutamic Acid 

Alanine 

Serine 

Threonine 

Methionine 

Glycine 
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TABLE 4. Residues Comprising Epito-Bearing Fragments 



Query 


Residues Comprising Epito-Bearing Fragments 


flOl.aa 


from about Lys-62 to about Gly-64, from about Ser-1 1 1 to about 
Asp-1 13, from about Arg-136 to about Arg-139, from about Pro- 
189 to about Asn-193. 


fll.aa 


from about Pro-38 to about Lys-40, from about Glu-92 to about 
Lys-96. 


fl2.aa 


from about Pro-288 to about Asp-290, from about Asn-336 to 
about Gly-338, from about Tyr-410 to about Gly-413, from about 

Asn-41 8 fo nhnnt Arof-d.90 frnm dKnnt D m +^ „l„„i \/„i ccc 
^j-<j w O.UK/UL .rug "ri.\j, injiii aoout rio-jjz to about val-jjj 

from 




about Gln-643 to about Asp-645, from about Gin- 1061 to about 
Arg-1063, from about Asn-1 130 to about Lys-1 132. 


fl29.aa 


from about Glu-76 to about Arg-81, from about Lys-144 to about 
Asn-146. 


fl47.aa 


from about Gln-94 to about Thr-96. 


fI52.aa 


from about Gfy-35 to about Gly-37, from about Gln-321 to about 
Gly-323. 


fl54.aa 


from about Asn-39 to about Lys-41, from about Ser-74 to about 

1 1 , Hum ciuuul oci-zij iq aoout vjiy-z 1 j , irom aoout oer-3U3 
to about Asp-306, from about Asp-422 to about Asn-424 


fl57.aa 


from about Lys-21 to about Asp-24, from about Ser-45 to about 
Tyr-47. 


flV.aa 


from about Arg-17 to about Asn-20, from about Tiir-94 to about 
Gly-96. 


fl86.aa 


from about Lys-305 to about Tyr-308 


fl96.aa 


from about T.v^-I^l to ahnnt Acn.lO'? fi-r-n-n oK/int Dt-^-l T7Q „u^,,+ 

iium auuui j->jro iu auuui AMri/j, irom aoout trTO-z /o to aoout 
Lys-282, from about Glu-576 to about Tyr-578. 


t899.aa 


from about Asn-174 to about Asp- 177. 


f925.aa 


from about Lys-201 to about Asp-204, from about Phe-291 to 


f929.aa 


from prion t "Pro-1 tr\ aKAnt A on 1 A i f^^r^ „ u , , ^ a 011 *. ~ t_ j. 

llum ^-uuui nu iJ7 lo dDout Asn-i^-i, irom aoout Arg-zl 1 to about 

(rln-^71'd from ahnnt TVir ^7fl tn oKnnt Ann QT-C 


t933.aa 


from about Ser-1 39 to about Lys-1 43. 


f940.aa 


from shnnt H-lv 1 A/X tr\ dViAnf A o-*-* AQ 
llUJ.il dUUUL KJxy-L^D LO dDOUL /\Sn-14-o. 


1943. aa 


from about Asp-58 to about Asp-60, from about Lys-1 57 to about 
Asn-159, from about Asp-217 to about Asp-221, from about Lys- 

lu auuui n.su ^jh-, uoiii aoout rro-zoz to aoout Asn-264, from 
about Gly-305 to about Trp-307. 


f952.aa 


from about Ser-52 to about Ser-54 


f4.aa 


from about Arg-64 to about Arg-67. 


143. aa 


trom about Ser-84 to about Gln-87, from about Asp-231 to about 
Tyr-233, from about Arg-296 to about Asp-300. 


ttU.aa 


from about Glu-136 to about Gly-138, from about Asp- 153 to 
about Lys-155, from about Asp-289 to about Asp-291 from about 
Glu-458 to about Asn-461 . 


f65.aa 


from about Glu-120 to about Asp- 122, from about Pro-204 to about 
Tyr-206. 


18. aa 


from about Pro-263 to about Arg-265, from about Asd-274 to about 
Lys-278. 


f82.aa 

] 


from about iyr-66 to about Giy-68, from about Ser-1 16 to about 
-ys-119, from about Asp- 121 to about Glv-123, from about Pro- 
128 to about Gly- 131. 
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r86.aa 


from about Asn-179 to about R1 fmm ^hrmi- t i/c iqo ^ 
about Asn-194, from about Lys-270 to about Asn-272, from about 
Lvs-279 to about Lvs-282 from nhont A^-n ^1 t-ri aKr-oii- Aon q^2q 


r477. aa 


from about Pro-250 to about Lys-253. 


f488.aa 


from ^Hnnt T tr\ cal-\r\nt T xrc HQ -Pr-^-r^ r>u^.-,+. a aozl i 

iiuiii duuui j^yi /o co aooui Lys- /y, trom about Asn-486 to about 

Asp-489, from about Lys-508 to about Gly-510, from about Asn- 
559 to about Gly-562. 


f'494.aa 


from about Lys-76 to about Asn-78. 


f516.aa 


from about Lys-32 to about Asp-34. 


f523.aa 


from about Pro-202 to about Asn-206, from about Lys-255 to about 
Tyr-258. 


f526.aa 


from ?)hnnt Asn-R-* to ahnnt T <s/c QQ -Ft-^t-v-i nk/x^t a i-">^ +~ ±. 

iium duuui oj io aooui i_,ys-oo, rrom about Asp-1 jo to about 
Gly-138. 


i5/7.aa 


from about Cys-18 to about Lys-22, rrom about Asn-297 to about 
Gln-300. 


f584.aa 


from about Pro-131 to about Lys-133, from about Pro-200 to about 
Ser-202. 


f396.aa 


from about Arg-42 to about Asp-44, from about Asp- 1 17 to about 
Tyr-119, from about Pro-205 to about Asp-207. 


f600.aa 


from about Pro- 143 to about Asp- 145. 


±603. aa 


from about Phe-35 to about Ser-37. 


f607.aa 


from about Gln-67 to about Lys-70, from about Asp-273 to about 
Tyr-275, from about Asp-333 to about Gly-338, from about Pro- 
359 to about Lys-362, from about Arg-409 to about Gly-41 1 


f6ll.aa 


from about Arg-133 to about Gly-135. 


f631.aa 


from about Pro-132 to about Asn-136, from about Asn-159 to 
about Tyr-161, from about Pro-216 to about Asp-218, from about 
Pro-220 to about Lys-223 


±'688.aa 


from about Lys-266 to about Asp-268, from about Lys-271 to 
about Asn-273, from about Lys-315 to about Lys-318 


f704.aa 


from about Lys-250 to about Lys-253. 


f/07.aa 


from about Lys-131 to about Asp- 134, from about Asp-246 to 
about Asn-249. 


f709.aa 


from about Tyr-39 to about Gly-42, from about Lys-148 to about 
Gly-150, from about Arg-269 to about Gly-272, from about Ser- 

466 to ar)Onf Tvr—d.f'iR from i^Konf Act-* /1QQ a L A „* a /I o 1 -C 

tw ^ auuui • L / i ^ u °i auuuL tvsu.-'+oy to aoout Asn-H-yi rrom 




u.wu.1 ij^j j i auuuL j^v&p-j / o, irom aoouc jtto-ch-z to about i^ys- 
644. 


U97.aa 


from about Pro-217 to about Asp-219, from about Glu-675 to about 
Asp-678, from about Pro-687 to about Asn-689 from about Glu- 
694 to about Gln-696. 


f200.aa 


from about Arg-174 to about Phe-179. 


f208.aa 


from about Arg-326 to about Ser-328. 


f210.aa 


from about Pro- 191 to about Ee-194. 


f221.aa 


from about Asn-133 to about Asn-135. 


t233.aa 


from about Arg-191 to about Gly-194. 


f269.aa 


from about Ser-2/i to about Thr-273, from about Asp-284 to about 
Gly-286. 


f29.aa 


from about Pro- 159 to about Ser-161. 


1290. aa 


trom about Pro-240 to about Gly-244. 


f291.aa 


from about Gin-267 to about Lys-269. 



m • 
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f296.aa 


from about Glu-98 to about Lys-101. 


f3.aa 


from about Asn-241 to about Lys-245. 


f30 aa 


from about Asn-156 to about Tyr-159, from about Asn-178 to 
about Lys-180. 


f939.aa 


from about Ser-245 to about Asn-249. 




from about Asn-80 to about Tyr-82, from about Lys-208 to about 
Ser-210. 


f742.aa 


from about Ser-141 to about Asp- 145, from about Asn-222 to about 
Gln-225, from about Asp-243 to about Tyr-247, from about Asn- 
249 to about Asn-251. 




from about Arg- 1 1 1 to about Gly- 114, from about Pro- 1 3 1 to about 
Asp- 134. 


f790 aa 


from about Thr-40 to about Asn-42, from about Ser-53 to about 
Ser-55, from about Lys-215 to about Asp-218, from about Asn-274 

rn ahnnt fTlv-977 


f792.aa 


from about Val-82 to about Ser-84, from about Ser-102 to about 
rVoii- ii/^f, lrom auuuL vjriii-iz. / to dOuut i yi-iju, irom duoui i^ys- 
309 to ctt>out A.sn-3 14* from cibout Lys~375 to 3.t)out Tlir-377 from 




about Pro-5 1 1 to about His-513, from about Thr-5 15 to about Asp- 


f797.aa 


from about Pro-119 to about Gly- 122, from about Lys-166 to about 
Asn-169. 


f799.aa 


from about Asn-3 1 to about Asn-34, from about Gln-44 to about 


f814.aa 


from about Ser-120 to about Ser-122, from about Arg-636 to about 
Asn-638, from about Cys-967 to about Ser-969. 


f820.aa 




f850.aa ' 


from about Tyr-159 to about Tyr-164, from about Gln-375 to about 
Asp-379. 


f853.aa 


from about Thr-180 to about Lys-184, from about Arg-231 to about 
Asp-233, from about Asn-252 to about Gly-254. 


f859.aa 


from about Lys-46 to about Ser-52, from about Pro-88 to about 
Asn-91; from about Asn-1 17 to about Asp-120. 


f86 1 .aa 


from about Asp-38 to about Lys-40, from about Lys-219 to about 
Asn-225. 


f368.aa 


from about Gln-228 to about Asn-231. 


f371.aa 


from about Tyr-109 to about Asn-1 1 1 , from about Asn-1 62 to 
about Gin- 164. 


f502.aa 


from about Asn-1 18 to about Lys-122, from about Ser-269 to about 
Gly-27 1 , from about Lys-370 to about Asp-373 , from about Asn- 
509 to about Lys-5 1 1, from about Lys-705 to about Arg-707, from 




about Thr-912 to about Gly-914, from about Pro- 12 13 to about 
Asp-1216, from about Asn-1491 to about Arg-1493. 


f527.aa 


from about Cys-20 to about Gln-22, from about Asn-38 to about 
Asn-40, from about Phe-1 12 to about Asp-1 14, from about Lys-160 
to about Asn-1 62, from about Ser-199 to about Asp-201, from 
about 




Gln-258 to about Gly-261, from about Arg-282 to about Asn-284, 
from about Ser-297 to about Asp-299. 


f541.aa 


from about Ser-68 to about Asn-71. 


f604.aa 


from about Lys-77 to about Gly-79, from about Lys-201 to about 
Asn-203, from about Asp-252 to about Asp-254, from about Tyr- 
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347 to about Gly-350, from about Asp-514 to about Trp-516. 


f736.aa 


from about Lys-20 to about Asn-24, from about Arg-147 to about 
Ser-153, from about Ser-231 to about Lys-233. 


f752.aa 


from about Thr-1 19 to about Lys-122, from about Pro-420 to about 
Gly-422. 


f798.aa 


from about Asp-33 to about Thr-36, from about Lys-180 to about 
His-183. 


f635.aa 


from about Pro-100 to about Asn-102, from about Asp-145 to 
about Phe- 147. 


f32.aa 


from about Lys-18 to about Asn-20. 


f320.aa 


from about Asn-193 to about Leu- 195, from about Gln-248 to 
about Lys-250. 


f352.aa 


from about Ser-46 to about Asn-49. 


f301.aa 


from about Lys-178 to about Lys-180, from about Ser-401 to about 
Tyr-404. . 


f373.aa 


from about Gly-88 to about Lys-90, from about Asn-539 to about 
Lys-542, from about Glu-654 to about Ser-657. 


f384.aa 


from about Pro-250 to about Asn-252, from about Asp-266 to 
about Lys-268. 


f446.aa 


from about Asp-20 to about Ser-26, from about Asn-146 to about 
Lys-149. 


f542.aa 


from about Arg-86 to about Gly-88, from about Arg-163 to about 
Asn-1 65. 


f93.aa 


from about Asn-1 52 to about Asp- 155. 


fl05.aa 


from about Asp-48 to about Phe-50. 


fl50.aa 


from about Thr-214 to about Asp-218, from about Asp-256 to 
about Asp-259. 


f219.aa 


from about Asn-77 to about Asn-81, from about Asp- 11 1 to about 


f229.aa 


from about Gln-61 to about Asn-63. 


f32.aa 


from about Lys-18 to about Asn-20. 


fl86.aa 


from about Lys-305 to about Tyr-308. 


f216.aa 


from about Ser-105 to about Asn-1 07. 


f328.aa 


from about Asn-1 05 to about Asp- 107. 


f352.aa 


from about Ser-46 to about Asn-49. 


f867.aa 


from about Thr-3 to about Gly-5, from about Lys-1 56 to about Ser- 
159. 


f868.aa 


from about Arg-94 to about Gly-96, from about Pro-257 to about 
Gly-261, from about Pro-295 to about Asp-297, from about Arg- 
340 to about Asp-342. 


f872.aa 


from about Ser-19 to about Lys-23, from about T3xr=139 to about 
Asp-142, from about Ser-282 to about Tyr-286, from about Ser-311 
to about Ser-313. 


f886.aa 


from about Thr-83 to about Asp-85, from about Asp- 106 to about 
Lys-108, from about Lys-143 to about Gly-147, from about Asp- 
186 to about Asn-191. 


f888.aa 


from about Asn-65 to about Asp-67. 


f893.aa 


from about Asn-203 to about Asn-207, from about Thr-446 to 
about Asn-450. 


f605.aa 


from about Arg-31 to about Asp-33. 


f606.aa 


from about Asn-68 to about Gly-71, from about Asn-136 to about 



TABLE 4. Residues Comprising Epito-Bearing Fragments 





Lys-139, from about Asn-223 to about Tyr-226, from about Ser- 
276 to about Tyr-279, from about Pro-362 to about Asn-365, from 




about Arg-503 to about Trp-507. 


f679.aa 


from about Lys-154 to about Asp- 156, from about Lys-224 to 
about Arg-226, from about Asn-260 to about Asp-264, from about 
Glu-363 to about Lys-366,. from about Asp-387 to about Gly-389, 
from 




about Tyr-441 to about Lys-443, from about Arg-501 to about Tyr- 
504. 


fll-12.aa 


from about Pro-91 to about Asn-93, from about Pro- 181 to about 
Asp- 186, from about Lys-244 to about Ser-248. 


fll-4.aa 


from about Asn-160 to about Lys-163. 


fl4-8.aa 


from about Pro-92 to about Gln-95, from about Lys-123 to about 
Thr-125, from about Lys-215 to about Asp-219. 


fl7-6.aa 


from about Pro-36 to about Glu-38. 


fl9-2.aa 


from about Ser-104 to about Ser-106, from about Gln-230 to about 
Asn-232. 


fl9-4.aa 


from about Val-79 to about Thr-82, from about Pro- 195 to about 
Gly-201. 


fl9-6.aa 


from about Asp-24 to about Lys-30, from about Pro-36 to about 
Glu-38. 


f21-4.aa 


from about Cys-24 to about Asn-26. 


f28-2.aa 


from about Ser-77 to about Lys-80, from about Tyr-274 to about 
Asn-277. 


f28-3.aa 


from about Glu-53 to about Arg-57, from about Gln-82 to about 
Asn-85, from about Gin- 157 to about Asn-159. 


f31-2.aa 


from about Arg-95 to about Arg-97, from about Asn-297 to about 
Asn-299. 


f4-15.aa 


from about Pro-182 to about Asp-184, from about Lys-220 to about 
Asp-222. 


f4-50.aa 


from about Thr-109 to about Asn-1 1 1. 


f42-l.aa 


from about Asn-55 to about Asn-57, from about Arg-81 to about 
Ser-84, from about Asp-94 to about Asn-97. 


f45-2.aa 


from about Asn-83 to about Gly-86. 


f47-2.aa 


from about Ser-29 to about Asp-33, from about Asn-94 to about 
Lys-99, from about Pro- 152 to about Lys-157. 


f49-2.aa 


from about Asn-452 to about Gly-454. 


f5-14.aa 


from about Glu-102 to about Asp- 106, from about Thr-272 to about 
Asn-275, from about Glu-3 13 to about Asn-3 15, from about Ser- 
370 to about Ser-372. 


f5-15.aa 


from about Lys-170 to about Gly-173, from about Asn-194 to 
about Gly- 196. 


f51-2.aa 


from about Asp-302 to about Lys-304. 


f6-21.aa 


from about Glu-38 to about Asn-42, from about Arg-84 to about 
Gly-87. 


f6-27.aa 


from about Asp-67 to about Asn-69, from about Arg-85 to about 
Asn-89, from about Lys-168 to about Gly-171, from about Lys-179 
to about Asn-181, from about Ser-380 to about His-382. 


f6-5.aa 


from about Ser-67 to about Asn-71. 


f7-30.aa 


from about Pro-94 to about Asp-96, from about Lys-144 to about 
Arg-147. 


f76-l.aa 


from about Asn-30 to about Lys-35, from about Lys-1 13 to about 
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Gly-1 16, from about Glu-1 19 to about Lys-121. 


f8-10.aa 


from about Pro-25 to about Lys-32, from about Ser-168 to about 
Thr-172. 


f01a.aa_bb001 


from about Pro-123 to about Asp-125, from about Ser-179 to about 
Asp- 181, from about Lys-255 to about Gly-259. 


_bb0011 


from about Ala8 about Arg 17, from about Tyr31 to about Gly40, 
from about Ser65 to about Lys78, from about Val93 to about 
Aspl02, from about Serl20 to about lie 129, from about Prol56 to 
about Glul70, from about Lysl87 to about Asn 196, from about 
His205 to about Lys214, from about Gly226 to about Glu235, fro 
about Gln253 to about Asn266, from about Glu283 to about 
Glu293, from about Leu31 1 to about He320, from about Arg326 to 
about Gly335, from about Pro340 to about Ala349 


f02a.aa_bb002 


from about Tyr-169 to about Asn- 171, from about Tyr-242 to about 
Asn-245, from about Lys-264 to about Asp-267. 


_bb9 


from about Met7 to about Lysl6, from about Lys47 to about Ser57, 
from about Asn80 to about Ser89, from about Glyl03 to about 
Glul 13, from about Lysl25 to about Prol33, from about Lysl38 
to about Alal47 


f03a.aa_bb006 


from about Asp-54 to about Thr-57, from about Lys-201 to about 
His-204. 


_bb014 


from about Pro23 to about Gln31, from about Ser37 to about 
Asp45, from about Leu76 to about Asn84, from about Leu76 to 
about Val84, from about Ser89 to about Asn97, from about Serl05 
to about Lysl 13, from about Asnl20 to about Met 128, from about 
Asnl59 to about Gly 167, from about Lysl73 to about Ball81 


_bb023 


from about Asp 17 to about Gly27, from about Arg40 to about 
Asp48, from about Val64 to about Asp72, from about Glul 05 to 
about Thrl 13, from about Serl41 to about Gly 150, from about 
Asp 155 to about lie 163, from about Asn 184 to about Lysl 98, from 
about Ile219 to about Pro227, from about Ser230 to about Phe238, 
from about Ser241 to about Asn250, from about Asp270 to about 
Val278 T from about Ser285 to about Leu293, from about Glyu307 
to about Ser315, from about Ly_s327 to about Asn335 


f08a.aa_bb024 


from about Asn-30 to about Asp-33, from about Ser-1 16 to about 
Asn-118, from about Asn-154 to about Gly- 156. 


f09a.aa_bb025 • 


from about Asn-30 to about Ser-35, from about Thr-145 to about 
Asn- 148. 
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What Is Claimed Is: 

1. An isolated nucleic acid molecule comprising a polynucleotide having a nucleotide sequence 
selected from the group consisting of: 

(a) a nucleotide sequence encoding any one of the amino acid sequences of the 
polypeptides shown in Table 1 ; or 

(b) a nucleotide sequence complementary to any one of the nucleotide sequences in (a). 

(c) a nucleotide sequence at least 95% identical to any one of the nucleotide sequences 
shown in Table 1 ; or, 

(d) a nucleotide sequence at least 95% identical to a nucleotide sequence complementary to 
any one of the nucleotide sequences shown in Table 1. 

2. An isolated nucleic acid molecule of claim 1 comprising a polynucleotide which hybridizes 
under stringent hybridization conditions to a polynucleotide having a nucleotide sequence identical 
to a nucleotide sequence in (a) or (b) of claim 1. 

3. An isolated nucleic acid molecule of claim 1 comprising a polynucleotide which encodes an 
epitope-bearing portion of a polypeptide in '(a) of claim 1. 

4. The isolated nucleic acid molecule of claim 3, wherein said epitope-bearing portion of a 
polypeptide comprises an amino acid sequence listed in Table 4. 

5. A method for making a recombinant vector comprising inserting an isolated nucleic acid 
molecule of claim 1 into a vector. 

6. A recombinant vector produced by the method of claim 5. 

7. A host cell comprising the vector of claim 6. 

8. A method of producing a polypeptide comprising: 

(a) growing the host cell of claim 7 such that the protein is expressed by the cell; and 

(b) recovering the expressed polypeptide. 

9. An isolated polypeptide comprising a polypeptide selected from the group 
consisting of: 




(a) a polypeptide consisting of one of the complete amino acid sequences of Table 1; 

(b) a polypeptide consisting of one the complete amino acid sequences of Table 1 except 
the N-terminal residue; 

(c) a fragment of the polypeptide of (a) having biological activity; and 

(d) a fragment of the polypeptide of (a) which binds to an antibody specific for the 
polypeptide of (a). 

10. An isolated antibody specific for the polypeptide of claim 9. 

1 1. A polypeptide produced according to the method of claim 8. 

12. An isolated polypeptide comprising an amino acid sequence at least 95% identical to a 
sequence selected from the group consisting of an amino acid sequence of any one of the 
polypeptides in Table 1. 

13. An isolated polypeptide antigen comprising an amino acid sequence of an B. burgdorferi 
epitope shown in Table 4. 

14. An isolated nucleic acid molecule comprising a polynucleotide with a nucleotide sequence 
encoding a polypeptide of claim 9. 

15. A hybridoma which produces an antibody of claim 10. 

1 6. A vaccine, comprising: 

(1) one or more B. burgdorferi polypeptides selected from the group consisting of a 
polypeptide of claim 9; and 

(2) a pharmaceutically acceptable diluent, carrier, or excipient; 

wherein said polypeptide is present, in an amount effective to elicit protective antibodies 
in an animal to a member of the Borrelia genus. 

17. A method of preventing or attenuating an infection caused by a member of the Borrelia genus 
in an animal, comprising administering to said animal a polypeptide of claim 9, wherein said 
polypeptide is administered in an amount effective to prevent or attenuate said infection. 

18. A method of detecting Borrelia nucleic acids in a biological sample comprising: 

(a) contacting the sample with one or more nucleic acids of claim 1, under conditions 
such that hybridization occurs, and 

(b) detecting hybridization of said nucleic acids to the one or more Borrelia nucleic acid 
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sequences present in the biological sample. 

19. A method of detecting Borrelia nucleic acids in a biological sample obtained from an animal, 
comprising: 

(a) amplifying one or more Borrelia nucleic acid sequences in said sample using 
polymerase chain reaction, and 

(b) detecting said amplified Borrelia nucleic acid. 

20. A kit for detecting Borrelia antibodies in a biological sample obtained from an animal, 
comprising 

(a) a polypeptide of claim 9 attached to a solid support; and 

(b) detecting means. 

2 1 . A method of detecting Borrelia antibodies in a biological sample obtained from an animal, 
comprising 

(a) contacting the sample with a polypeptide of claim 9; and 

(b) detecting antibody-antigen complexes. 
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